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ABSTRACT 


Response  surface  methodology  (RSM)  is  used  for  optimality  analysis  of  the  cost 
parameters  in  mixed  integer  linear  programming.  This  optimality  analysis  goes  beyond 
traditional  sensitivity  and  parametric  analysis  in  allowing  investigation  of  the  optimal 
objective  function  value  response  over  pre-specified  ranges  on  multiple  problem 
parameters.  Design  of  experiments  and  least  squares  regression  are  used  to  indicate  which 
cost  parameters  have  the  greatest  impact  on  the  optimal  objective  function  value — total 
cost — and  to  approximate  the  optimal  total  cost  surface  over  the  specified  ranges  on  the 
parameters.  The  mixed  integer  linear  programming  problems  of  interest  are  the  large-scale 
problems  in  supply  chain  optimization — also  known  as  facility  location  and  allocation 
problems.  Furthermore,  this  optimality  analysis  technique  applies  to  optimality  analysis  of 
costs  or  right-hand-side  elements  in  continuous  linear  programs  and  optimality  analysis  of 
costs  in  mixed  of  pure  integer  linear  programs.  A  system  which  automates  this  process  for 
supply  chain  optimization  at  PFS  Logistics  Consulting  is  also  detailed,  along  with 
description  of  its  application  and  impact  in  their  daily  operations. 

Keywords:  Optimality  Analysis,  Response  Surface  Methodology,  Design  of  Experiments, 
Group  Screening  Experimental  Design,  Mixed  Integer  Linear  Programming,  Supply  Chain 
Optimization,  Metamodels. 
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CHAPTER  1 


The  research  documented  in  this  thesis  was  sponsored  by  PFS  Logistics  Consulting 
(PFS  LC),  A  Division  of  PepsiCo,  Inc.  Analysts  at  PFS  LC  assist  organizations  within  the 
PepsiCo  family  and  external  clients  in  improving  their  logistics,  or  supply  chain,  stmcture. 
These  projects  are  called  supply  chain  optimizations  and  the  insights  they  provide  can  save 
millions  of  dollars  for  the  chent  organizations.  In  an  effort  to  streamhne  the  supply  chain 
analysis  process  and  provide  added  value  to  their  customers,  PFS  LC  decision  makers 
sought  a  tool  for  optimality  analysis  of  the  cost  data  used  in  their  supply  chain 
optimization  models.  Optimality  analysis  is  the  broad  category  of  approaches  used  in 
mathematical  programming  to  assess  how  the  optimal  solution  is  affected  by  changes  in 
the  model  parameters.  It  includes  the  traditional  sensitivity  analysis  and  parametric 
analysis  techniques  as  well  as  approaches  which  allow  you  to  define  a  range  of 
investigation  on  each  of  many  parameters  and  investigate  the  optimal  objective  function 
value  surface  over  the  defined  ranges. 

Our  goal  was  to  provide  them  with  a  tool  that  could  indicate  the  costs  with  the 
greatest  influence  on  the  optimal  solution  in  a  reliable,  accessible,  expeditious  fashion  and 
enhance  their  optimahty  analysis  capabilities  overall.  We  have  done  that  in  a  PC-based 
system  which  we  call  the  Cost  Impact  Assessment  Tool,  or  CIAT.  CIAT  is  a  loosely 
connected  suite  of  Turbo  Pascal  routines.  Excel  spreadsheets  and  macros,  and  IMP 
statistical  software  designed  to  work  with  the  SAILS™  supply  chain  optimization 
software. 

The  remainder  of  this  thesis  explains  the  theoretical  underpinnings  of  CIAT  and  the 
method  for  its  implementation.  Textual  copies  of  the  Turbo  Pascal  routines  and  Excel 
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macros,  a  description  of  SAILS™  and  IMP  as  they  are  used  in  Cl  AT,  and  the  base  Excel 
spreadsheet  which  serves  as  a  user  interface  are  included.  The  thesis  is  organized  so  that 
Chapter  2  is  a  stand-alone  article  on  our  research,  suitable  for  submission  to  an  academic 
journal.  Chapter  3  oudines  some  obvious  extensions  to  this  work.  The  appendices 
contain  the  details  of  Cl  AT  and  its  theoretical  background  as  follows.  Appendix  A 
describes  the  PFS  LC  environment  and  details  the  concerns  which  led  to  the  development 
of  CIAT.  Appendix  B  contains  an  extensive  literature  review  which  includes  explanation 
of  the  theory  behind  the  ODS  solver  engine  used  in  SAILS™  and  the  techniques  applied  in 
CIAT.  Appendix  C  begins  with  a  brief  introduction  to  the  SAILS™  package  and  includes 
a  description  of  the  input  and  output  files  in  the  SAILS™  modeling  environment. 
Appendix  D  is  a  detailed  oudine  of  the  process  used  when  CIAT  is  applied  to  a  supply 
chain  optimization  project.  Appendix  E  reveals  the  basic  Excel  spreadsheet  which  serves 
as  a  user  interface  for  CIAT  and  the  Excel  macros  used  to  automate  some  steps  in  a  CIAT 
application.  Finally,  Appendix  F  contains  the  text  of  Turbo  Pascal  routines  used  in  CIAT. 
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CHAPTER  2 


2.1.  INTRODUCTION 

Supply  chain  optimization  saves  millions  of  dollars  each  year  for  a  wide  array  of 
industries.  Virtually  any  corporation  that  obtains  raw  materials,  produces  consumer 
goods,  and  delivers  them  to  market  is  concerned  with  the  efficiency  of  their  logistics 
network,  or  supply  chain.  Groups  such  as  PFS  Logistics  Consulting  (PFS  LC),  A  Division 
of  PepsiCo,  Inc.,  specialize  in  supply  chain  optimization — developing  models  of  a 
business’  logistics  and  recommending  changes  to  improve  or  optimize  efficiency. 

Whenever  a  company  contracts  a  supply  chain  optimization  project  with  PFS  LC, 
their  work  follows  the  timeline  pictured  in  Figure  2.1  below.  (Barlaz,  1996;  Clarkson, 
1996;  Deviprasad,  1996;  Hansen,  1996;  Villareal,  1996) 


OPTIMIZA'nON 

MODEL  &SENSraVnT 

DESIGN  Validation  recommendations 


Data  Collectig 

N  &  Analysis 

Me 

BU 

)DEL 

ILDING 

Fig.  2.1.  Sample  timeline  for  supply  chain  optimization  project. 

We  focus  on  two  stages  in  the  timeline — ^Data  Collection  &  Analysis,  and 
Optimization  &  Sensitivity.  The  Data  Collection  &  Analysis  stage  clearly  requires 
the  most  time  in  a  project;  thus,  we  concentrate  on  providing  a  system  for  streamlining  this 
stage.  The  system  developed — which  we  call  the  Cost  Impact  Assessment  Tool,  or 
CIAT — also  applies  to  the  sensitivity  analysis  excursions  in  the  OPTIMIZATION  & 
Sensitivity  stage.  This  article  gives  the  theoretical  foundation  for  CIAT  and  outlines  its 
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use.  It  also  describes  differences  between  the  two  stages,  and  illustrates  the  use  of  CIAT 
through  a  Data  Collection  &  Analysis  scenario  and  an  Optimization  &  Sensitivity 
scenario  performed  on  a  sample  supply  chain. 

2.2.  PERTINENT  LITERATURE 

2.2.1.  Supply  Chain  Optimization. 

Supply  chain  optimization  is  explored  extensively  in  the  mathematical 
programming  literature.  (Sengupta  and  Turnbull,  1996;  Amtzen,  Brown,  Harrison,  and 
Trafton,  1995;  Geoffrion  and  Powers,  1993;  Geoffrion  and  Graves,  1974)  Although  the 
term  supply  chain  was  coined  in  the  1990’s,  these  problems  have  been  formulated  and 
solved  for  decades.  Sometimes  referred  to  as  the  facility  location  and  allocation  problem, 
the  supply  chain  optimization  problem  is  generally  modeled  as  a  large-scale  mixed  integer 
linear  programming  problem.  (Francis  and  White,  1974;  Love,  Morris,  and  Wesolowsky, 
1988)  Although  economies  of  scale  can  be  approximated  with  a  piece-wise  linear 
objective  function,  when  they  are  expected  to  be  significant  the  model  is  revised  to 
incorporate  a  nonlinear  objective  function.  Due  to  the  large  number  of  continuous  and 
binary  variables  typically  involved  in  a  model  of  this  sort,  developing  efficient  solution 
techniques  is  another  fertile  area  for  research.  Methods  such  as  factorization  and  Benders’ 
Decomposition  are  proposed  and  incorporated  into  solution  engines.  (Geoffrion  and 
Graves,  1974;  Brown,  Graves,  and  Honczarenko,  1987) 

2.2.2.  Optimality  Analysis. 

Optimality  analysis  is  another  fruitful  area  for  research  in  mathematical 
programming.  Optimality  analysis  is  the  broad  category  of  techniques  for  investigating 
how  the  optimal  solution  of  a  mathematical  program  is  affected  by  changes  in  the  model 
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parameters.  It  includes  the  traditional  sensitivity  and  parametric  analysis  techniques.  Due 
to  the  necessary  and  sufficient  conditions  associated  with  an  optimal  solution  to  a 
(continuous)  linear  programming  problem,  the  sensitivity  of  the  solution  to  problem  data  is 
easy  to  compute  and  is  often  provided  directly  by  the  solver  employed.  (Bazaraa,  Jarvis, 
and  Sherali,  1990;  Winston,  1994;  Ward  and  Wendell,  1990)  Techniques  are  also 
available  which  show  how  the  optimal  solution  changes  over  specified  ranges  on  all  of  the 
problem  parameters.  A  particularly  applicable  approach  uses  designed  experiments  to 
specify  an  efficient  list  of  problem  parameter  settings  for  investigation.  Then,  response 
surface  methodology  (RSM)  is  used  to  develop  a  mathematical  model  of  the  surface  based 
on  the  solutions  generated  for  each  scenario  in  the  list.  (P.  W.  Smith,  1975;  Johnson, 
Bauer,  Moore,  and  Grant,  1996) 

Unfortunately,  the  discrete  nature  of  the  feasible  solution  space  for  a  mixed  integer 
linear  programming  problem  complicates  the  issue  of  sensitivity  analysis.  Therefore,  there 
is  less  literature  on  sensitivity  analysis  approaches  for  mixed  integer  linear  programming. 
(Geoffrion  and  Nauss,  1977;  Nauss,  1979)  We  will  show  that  techniques  used  for  analysis 
of  (continuous)  linear  programming  problems  can  also  be  applied  in  mixed  integer  linear 
programming.  Specifically,  the  design  of  experiments/response  surface  methodology 
(DOE/RSM)  approach  can  describe  the  sensitivity  of  the  optimal  objective  function  value 
(cost)  with  respect  to  changes  in  the  cost  parameters. 

2.3.  SPECIFIC  PROBLEM 

Returning  to  the  project  timeline  illustrated  in  Figure  2.1,  we  describe  each  stage 
and  explain  the  need  for  our  system  in  the  Data  Collection  &  Analysis  and 
Optimization  &  Sensitivity  stages. 
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2.3.1.  Model  Design. 

This  first  stage  in  the  project  timeline  involves  sketching  out  the  network 
configuration  of  the  business’  supply  chain,  and  deciding  what  level  of  detail  is  appropriate 
for  the  analysis.  The  network  representation  for  a  simple  supply  chain  is  given  in  Figure 
2.2.  In  the  Model  Design  stage,  the  analyst  determines  which  facilities  are  in  the 
network  and  how  those  facilities  are  connected;  determines  the  importance  of  individual 
production  lines;  and  decides  what  specific  questions  are  to  be  answered  by  the  analysis. 
Some  of  the  questions  typically  answered  through  a  supply  chain  optimization  project  are; 
•“How  many  distribution  centers  (DCs)  are  in  the  best  (optimal)  solution?”, 
•“Where  should  we  place  our  DCs?”, 

•“Which  DCs  should  each  production  facility  feed?”, 

•“Which  customer  zones  should  be  supplied  by  which  DCs,  and  to  what  degree?”, 
•“How  many  plants  are  in  the  optimal  solution?”, 

•  “Where  should  the  plants  be  located?”, 

•“How  should  products  be  allocated  to  plants?”,  and 
•“Where  should  raw  materials  be  acquired?”. 

Although  analysts  at  PFS  LC  insist  that  the  supply  chain  must  be  considered  as  a 
whole,  rather  than  by  focusing  on  its  individual  components  or  echelons,  our  CIAT 
prototype  was  designed  to  focus  on  the  DC  echelon.  (Clarkson,  1997)  The  DCs  are  a 
good  starting  place  since  they  often  provide  a  great  opportunity  for  savings.  Moreover, 
the  client  company  is  often  interested  in  making  changes  to  their  DCs  even  before  the  PFS 
LC  analyst  steps  in.  (Barlaz,  1996;  Clarkson,  1996;  Deviprasad,  1996;  Hansen,  1996; 
Villareal,  1996) 
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Raw  Mat.  Production  Distribution  Customer 

Vendor  Facilities  Centers  Z^s 


Fig.  2.2.  Supply  chain  network  representation. 


2.3.2.  Data  Collection  and  Analysis. 

At  this  time,  data  such  as  costs  and  distances  are  developed  for  the  network 
model.  Although  this  sounds  like  a  simple  endeavor,  many  of  the  data  elements  required 
are  not  readily  available.  Accounting  systems  used  by  the  client  company  may  make  it 
difficult  to  isolate  the  variable  cost  for  space  at  a  current  distribution  center.  For  instance, 
the  accountant’s  fixed  cost  on  a  warehouse  facility  likely  includes  all  space  related  costs 
(i.e.  rent).  However,  for  strategic  modeling  purposes,  analysts  at  PFS  LC  believe  that  a 
portion  of  the  rent  should  be  considered  a  variable  cost  as  a  function  of  facility  capacity. 
(Barlaz,  1996;  Clarkson,  1996;  Deviprasad,  1996;  Hansen,  1996;  Villareal,  1996) 

Also,  supply  chain  optimization  often  evaluates  proposed  facilities  which  do  not 
yet  exist.  Therefore,  the  costs  for  these  proposed  facilities  must  be  approximated  based 
upon  the  expected  scale  of  the  facility  and  its  location.  In  these  cases,  the  costs  must  be 
“engineered”  from  a  zero  base,  which  is  a  time  consuming  process.  (Clarkson,  1997) 
Another  approach  is  to  conduct  a  regression  analysis,  with  facility  costs  modeled  as  a 
function  of  facility  size.  In  this  case,  extensive  research  may  be  required  to  obtain  data  for 
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building  the  regression  model.  (Barlaz,  1996;  Clarkson,  1996;  Deviprasad,  1996;  Hansen, 
1996;  Villareal,  1996) 

2.3.3.  Model  Building. 

In  this  stage,  the  data  and  network  representation  of  the  supply  chain  are 
combined.  The  analyst  translates  the  network  configuration  and  data  into  a  mathematical 
model  suitable  for  solution  by  a  mathematical  programming  solver.  The  model  generally 
begins  in  an  unconstrained  form,  with  the  analyst  incrementally  increasing  the  detail, 
through  addition  of  model  constraints,  until  the  analyst  and  client  are  satisfied  with  the 
model’s  level  of  fidehty.  This  stage  may  be  revisited  later  if  shortfalls  in  the  model  are 
discovered,  or  if  contacts  in  the  chent  company  are  not  satisfied  with  the  level  of  detail 
included.  (Barlaz,  1996;  Clarkson,  1996;  Deviprasad,  1996;  Hansen,  1996;  ViUareal, 
1996) 

2.3.4.  Validation. 

This  stage  ensures  that  the  model  appropriately  reflects  the  real  supply  chain.  This 
generally  involves  running  the  model  with  historical  data  to  verify  that  the  solutions  it 
returns  are  acceptable  in  hght  of  the  actual  system  behavior  observed,  with  further 
refinements  made  as  departures  from  reality  are  noted.  Again,  with  the  client  company 
involved,  the  project  does  not  proceed  without  mutual  satisfaction.  (Barlaz,  1996; 
Clarkson,  1996;  Deviprasad,  1996;  Hansen,  1996;  Villareal,  1996) 

2.3.5.  Optimization  and  Sensitivity. 

Finally,  useful  results  are  obtained  in  the  supply  chain  optimization  project.  The 
current  or  projected  data  are  used  in  the  validated  model  to  determine  what  network 
configuration  is  best,  and  what  the  expected  cost  and  profit  figures  are  for  this 


2-6 


configuration.  This  stage  may  also  involve  consideration  of  customer  service  standards 
for  the  selected  network,  and  comparisons  to  current  networks  or  competing 
configurations  of  interest  to  the  decision  makers.  Then,  scenarios  are  run  to  test  the 
sensitivity  of  the  solution  to  changes  in  the  input  data.  (Barlaz,  1996;  Clarkson,  1996; 
Deviprasad,  1996;  Hansen,  1996;  Villareal,  1996) 

2.3.6.  Recommendations. 

In  this  final  stage,  results  are  prepared  for  presentation  to  the  decision  makers. 
This  stage  includes  determination  of  alternative  solutions  which  may  be  preferable  to  the 
decision  maker  for  reasons  not  captured  in  the  optimization  model.  Also,  if  many 
sweeping  changes  are  suggested  by  the  optimal  solution,  the  analyst  creates  prioritized 
lists  of  the  changes  in  order  to  facilitate  a  gradual  transition  and  reap  the  greatest  savings 
for  the  client.  (Barlaz,  1996;  Clarkson,  1996;  Deviprasad,  1996;  Hansen,  1996;  Villareal, 
1996) 

The  entire  process  provides  an  excellent  opportunity  for  the  client  company  to  step 
back  and  evaluate  the  way  it  does  business.  The  greatest  benefit  occurs  if  contacts  within 
the  client  company  continue  to  be  energized  and  motivated  throughout  the  project 
timeline.  Unfortunately,  the  rigors  of  gathering  the  data  requested  by  PFS  LC  analysts  in 
the  second  stage,  coupled  with  associated  delays  in  the  project,  makes  it  difficult  to 
maintain  client  enthusiasm.  Analysts  at  PFS  LC  need  a  tool  to  focus  and  streamline  the 
Data  Collection  &  Analysis  stage  of  their  analysis,  in  order  to  prevent  enthusiasm 
from  waning.  A  tool  for  broadening  their  optimality  analysis  capabilities  for  use  in  the 
Optimization  &  Sensitivity  stage  would  also  increase  client  involvement.  (Barlaz,  1996; 
Clarkson,  1996;  Deviprasad,  1996;  Hansen,  1996;  Villareal,  1996) 
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2.4.  METHODOLOGY 


2.4.1.  Theoretical  Basis. 

Since  RSM  fits  a  smooth  mathematical  function  to  the  results  of  a  designed 
experiment,  it  is  not  automatically  thought  of  as  a  means  to  express  the  optimal  solution  of 
a  mixed  integer  linear  programming  problem,  which  has  discrete  jumps  in  its  feasible 
solution  space.  However,  if  the  objective  function  cost  parameters  are  used  as  the  factors 
in  the  experimental  design  and  the  optimal  objective  function  value  (total  cost)  is  viewed 
as  the  response,  a  second-order  response  surface  can  provide  an  appropriate 
representation. 

The  quadratic  models  of  RSM  have  been  used  for  optimality  analysis  of  both  the 
objective  function  coefficients  and  the  demands  and  resource  levels  of  the  right-hand  side 
vector  in  a  continuous  linear  programming  context.  (P.  W.  Smith,  1975;  Johnson,  Bauer, 
Moore,  and  Grant,1996)  Through  the  following  theoretical  development,  we  show  that 
RSM  can  also  be  applied  to  optimality  analysis  of  the  cost  parameters  in  a  mixed  integer 
linear  programming  problem. 

Lemma  2.1.  The  optimal  value  function  of  the  cost  coefficients  in  an  integer  linear 
programming  problem  is  continuous,  piece-wise  linear,  and  concave.  (Geofftion  and 
Nauss,  1977:459) 

Lemma  2.2.  (The  Weierstrass  Approximation  Theorem)  “Let /be  real-valued  and 
continuous  on  a  compact  interval  [a,  b].  Then  for  every  e  >  0  there  is  a  polynomial p 
(which  may  depend  on  £)  such  that  \f(x)-p(x)\  <  e  for  every  jc  in  [a,  6].”  (Apostol, 

1974:322) 
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Theorem  2.1.  The  optimal  value  function  of  a  mixed  integer  linear  program  as  a 
function  of  its  cost  parameters, /(rj,  can  be  approximated  by  a  polynomial  over  pre- 
specified  ranges  in  the  cost  parameters. 

Proof.  The  result  follows  directly  from  the  lemmas.  Our  pre-specified  ranges  in 
the  cost  parameter  form  a  hypercube.  This  hypercube  is  compact  since  any  closed, 
bounded  interval  on  the  real  numbers  is  compact  and  an  intersection  of  compact  sets  is 
compact.  By  Lemma  2.1,  f(t)  is  continuous.  Hence,  by  application  of  Lemma  2.2,  f(t)  can 
be  approximated  by  a  polynomial  over  this  pre-specified  hypercube.  ■ 

Furthermore,  since  the  true  optimal  objective  function  value  surface  is  known  to  be 
concave,  we  would  expect  a  linear  or  quadratic  function  to  provide  an  acceptable 
approximation.  However,  since  the  true  optimal  value  function  is  faceted  rather  than 
smooth  (i.e.  continuously  differentiable),  the  approximation  can  never  be  exact.  (Johnson, 
Bauer,  Moore,  and  Grant, 1996:49)  It  is  critical  to  note  that  the  optimal  value  function  of 
the  right-hand-side  parameters  in  a  mixed  integer  linear  programming  problem  does  not 
meet  these  requirements,  so  Theorem  2.1.  does  not  justify  applying  an  RSM  technique  for 
optimality  analysis  of  these  parameters. 

In  cases  where  a  large  number  of  cost  factors  are  uncertain,  statistical  techniques 
such  as  group  screening  and  specialized  experimental  designs  can  identify  the  most 
important  cost  coefficients.  (See  Box  and  Draper,  1987;  Box,  Hunter,  and  Hunter,  1978; 
or  Montgomery,  1976  for  a  solid  background  on  experimental  design.)  Due  to  our  unique 
knowledge  of  the  response  behavior  relative  to  changes  in  the  cost  coefficients,  group 
screening  is  a  particularly  useful  way  of  targeting  the  most  influential  costs.  (Watson, 

1961;  Patel,  1962) 
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In  group  screening,  related  factors  are  varied  simultaneously  as  a  unified  group 
factor  in  the  experimental  design.  One  major  concern  is  that  grouping  costs  together 
could  mask  individual  factor  effects  on  the  response  due  to  cancellation.  (Kleijnen,  1987a; 
Kleijnen,  1987b;  Kleijnen,  1975;  Mauro,  1986;  Mauro,  1983;  D.E.  Smith  andMauro, 
1982)  For  example,  suppose  that  Factor  A  causes  a  tremendous  increase  in  the  response. 
We  say  that  Factor  A  has  a  positive  effect  on  the  response.  If  Factor  A  is  grouped  with 
other  factors  that  contribute  a  proportional  decrease  in  the  response  (i.e.  have  a  negative 
effect  on  the  response),  that  group  would  be  incorrectly  deemed  unimportant.  This 
cancellation  can  never  occur  in  our  context,  since  the  following  theorem  shows  that  no 
cost  coefficient  can  have  a  negative  effect  on  the  optimal  total  cost  which  is  our  response. 

Theorem  2.2.  Increasing  a  cost  coefficient  value  can  never  have  a  negative  effect 
on  the  optimal  total  cost  of  a  (mixed  integer)  linear  program  with  minimization  objective. 

Proof.  The  proof  is  by  contradiction.  Suppose  the  theorem  were  not  valid,  and 
that  cost  N  (on  variable  N)  has  a  negative  effect  on  the  optimal  total  cost.  There  are 
exactly  two  possibilities  for  cost  N. 

Case  i:  Variable  N  is  non-basic  before  cost  N  is  increased.  Since  cost  N 
has  a  negative  effect  on  the  optimal  total  cost,  increasing  cost  N  from  its  original 
value  caused  a  decrease  in  the  optimal  total  cost.  Yet,  if  variable  N  is  non-basic  in 
the  original  solution,  increasing  cost  N  will  not  cause  variable  N  to  become  basic. 
Hence,  cost  N  will  have  no  effect  on  the  optimal  total  cost. 

Case  ii:  Variable  N  is  basic  before  cost  N  is  increased.  Once  cost  N  is 
increased,  two  things  may  happen.  Either  variable  N  will  remain  basic  or  it  wiU  be 
replaced  in  the  basis  by  variable  R.  If  variable  N  remains  basic,  then  its  increased 
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cost  causes  an  increase  in  the  optimal  total  cost.  If  variable  R  replaces  variable  N 
in  the  optimal  solution  once  cost  N  is  increased,  this  is  because  use  of  variable  R  is 
now  less  costly  than  use  of  variable  N.  However,  if  variable  R  were  less  costly 
than  variable  N  before  the  increase,  variable  N  would  not  have  been  in  the  basis 
before  the  cost  increase.  Hence,  the  optimal  total  cost  cannot  decrease  in  this  case 
either.  ■ 

A  final  concern  with  the  use  of  group  screening  in  this  optimality  analysis  context 
is  detection  of  quadratic  effects.  Without  some  method  to  gauge  curvature,  a  group  of 
factors  which  contribute  a  quadratic  effect  about  the  design  center  would  also  be 
incorrectly  labeled  insignificant.  Unfortunately,  since  our  underlying  model  is  the 
deterministic  mathematical  program,  the  standard  statistical  tests  for  curvature  or  lack  of 
linear  fit  are  not  appropriate. 

Instead,  we  can  use  the  R^  value  to  indicate  the  level  of  curvature  present  when  a 
linear  regression  model  has  been  fitted  to  the  data.  R^,  the  ratio  of  the  sum  of  squares 
explained  by  the  model  to  the  total  corrected  sum  of  squares,  reflects  the  proportion  of  the 
variation  of  the  response  values  explained  by  the  factor  levels  in  the  model.  Since  the  only 
error  present  in  our  model  is  due  to  lack  of  fit  of  the  linear  regression  model,  R^  decreases 
with  increased  curvature.  No  curvature  could  be  measured  without  the  inclusion  of  some 
number  of  center  point  rows,  with  all  factors  set  at  the  midpoint  of  their  specified  range. 
We  now  turn  to  the  question  of  how  the  center  point  response  should  be  weighted  or 
duplicated  in  the  regression  model,  in  order  for  R^  to  properly  reflect  the  presence  or  lack 
of  curvature. 
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Since  the  underlying  system  is  deterministic,  one  objection  to  including  multiple 
center  points  is  that  we  are  adding  information  that  is  already  included  in  the  model. 
However,  varying  the  number  of  center  point  rows  simply  controls  the  relative  weight  of 
the  responses  presented  when  fitting  the  regression  model.  For  instance,  if  a  single  center 
point  row  is  used  in  all  cases,  will  not  provide  a  consistent  assessment  of  curvature  in 
the  underlying  response.  Including  a  single  center  point  row  regardless  of  the 
experimental  design  ensures  that  the  value  will  be  notably  lower  in  a  full  factorial 
design  than  in  its  fractionated  counterparts.  This  occurs  because  the  single  center  point 
response  is  a  smaller  proportion  of  the  entire  data  table  in  a  full  factorial  design. 

Therefore,  in  order  to  provide  a  gauge  which  reflects  curvature  equally  across  design 
choices,  the  center  point  response  is  weighted  equally  across  designs  by  including  copies 
of  the  center  point  row  as  a  function  of  the  design  used. 

Since  we  want  to  assess  curvature  in  each  dimension,  it  makes  sense  to  include  a 
center  point  row  for  each  factor  in  a  full  factorial  design.  In  a  fractional  factorial  design, 
the  number  of  center  point  rows  should  also  be  fractionated  so  that  the  responses 
observed  at  comer  points  and  the  center  point  are  weighted  as  they  would  be  in  the  full 
factorial  design.  For  example,  where  three  center  points  are  included  in  a  full  factorial 
design  on  three  factors  (eight  comer  points),  the  half  factorial  design  on  three  factors  (four 
corner  points)  suggests  one  and  one-half  center  points,  with  one  or  two  actually  used. 

Another  alternative  for  gauging  curvature  is  to  build  a  regression  model  without 
inclusion  of  any  center  points  and  calculate  the  prediction  error  at  the  center  point.  This 
error  reflects  the  dollar  error  present  in  total  cost  (optimal  objective  function  value)  due  to 
using  the  first-order  regression  model.  Therefore,  a  quadratic  model  should  be  employed 
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if  the  center  point  prediction  error  is  unacceptably  large.  The  addition  of  center  points  to 
the  model  affects  only  the  intercept  term  in  the  regression  equation;  thus,  the  parameters 
on  the  factors  indicate  the  relative  importance  of  each  factor  or  group,  regardless  of  which 
approach  is  used  to  measure  curvature. 

If  either  of  the  two  methods  for  assessing  curvature  indicate  the  first-order 
regression  model  is  an  insufficient  representation  of  the  optimal  objective  function  value 
response  surface,  then  a  second-order  regression  model  and  experimental  design  are 
appropriate.  The  additional  optimization  runs  required  by  the  selected  Box-Benken  design 
or  face-centered  cube  central  composite  design  are  submitted  to  the  mathematical 
programming  solver  and  a  quadratic  model  is  fit  using  least  squares  regression.  (Note  that 
we  are  not  concerned  with  blocking  effects  since  the  system  is  deterministic.) 

2.4.2.  Practical  Considerations. 

The  DOE/RSM  approach  to  sensitivity  analysis  of  the  cost  coefficients  in  a  mixed 
integer  linear  program  is  fairly  straightforward  in  its  application,  as  shown  in  the  flow 
chart  at  Figure  2.3.  First,  the  cost  parameters  are  selected  and  upper  and  lower  bounds 
are  specified  for  the  cost  parameters  of  interest. 

If  thirty-one  or  fewer  costs  are  selected,  they  are  used  as  factors  in  a  two-level 
experimental  design.  PC  software  packages  are  available  which  list  the  number  of 
scenarios  required  for  a  full  factorial  experiment  on  a  specified  number  of  factors,  or  the 
associated  fractional  factorial  or  Plackett-Burman  designs.  We  use  the  IMP  statistical 
package  produced  by  the  SAS  Institute,  Inc.  Once  a  design  is  selected  from  the  list  of 
choices,  IMP  produces  a  table  of  -1,  +1,  and  0  coded  factor  settings  for  that  design. 
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Extract  costs  from 
SAILS  input  files  Intd 
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Generate  design 
table  in  JMP  for 
factors. 


(approx.  10  sec.) 


(approx.  3  min.) 


• 

Drop  groups  which 

* 

have  relatively  small 

sum  of  squares. 

(approx.  5  micuy. 


preak  remaining  groups 
Into  smaller  groups  or 
Individual  costs  for  use 
as  factors. 


pring  coded  design  table] 
into  "Work  Space"  & 
calculate  true  settings| 
for  design  runs. 


(approx.  5  min.) 


Create  SAILS  files 
for  runs  In  design 
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(approx.  5-10  min.) 


Load  &  launchi 
I  SAILS  runs.  I 


(approx.  2  min.) 


Extract  response  valuea 
|from  SAILS  _.mln  files  4 
add  to  coded  table  in 
JMP. _ 

Analyze  model  w/ 
linear  least  squares 
regression. 


4There  is  a  problem  dependent 
time  lapse  here  while  the  SAILS 
solver  runs.) 


(approx.  5-10  min.) 


(approx.  5  min.) 


(approx.  5  min.) 


Generate 
second-order 
design  table  In  JMP. 


(approx.  10  sec.) 


Determine  cost 
\drivers  by  proportion^ 
of  sum  of  squares,  t 


Use  rows  which 
have  not  been  run  as 
new  design  table. 


(approx.  20  sec.) 


Fig.  2.3.  Flow  chart  for  CIAT  application. 

Each  row  in  the  table  specifies  the  cost  settings  of  each  factor  for  a  single  model 
scenario — with  -1  denoting  the  lower  bound,  +1  the  upper  bound,  and  0  the  midpoint. 
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If  more  than  thirty-one  costs  are  selected  for  a  sensitivity  analysis,  a  grouping 
strategy  is  employed.  In  this  case,  groups  of  related  costs  serve  as  factors  in  the 
experimental  design.  Thus  a  -1  setting  for  a  group  factor  means  that  all  costs  in  the  group 
should  be  set  at  their  lower  bounds  for  that  scenario;  similarly,  the  +1  and  0  entries  for  the 
group  factor  require  setting  all  costs  in  the  group  to  their  upper  bounds  and  midpoint 
values,  respectively. 

Once  an  experimental  design  has  been  generated  for  the  factors  selected,  the 
scenarios  in  that  design  are  submitted  to  the  mixed  integer  linear  programming  solver.  A 
center  point  run — all  factors  at  their  0  level — ^must  be  included  as  a  scenario  in  order  to 
assess  the  presence  of  curvature.  We  apply  a  solver  which  is  used  by  PFS  LC.  Produced 
by  INSIGHT,  Inc.,  and  called  SAILS™-ODS,  it  is  one  of  two  solvers  in  a  package 
tailored  to  supply  chain  optimization. 

In  the  following  passages,  we  mention  the  optimal  objective  function  value  which 
is  used  as  the  response  in  our  designs.  Since  the  SAILS™-ODS  solver  actually  iterates  on 
the  solution  until  a  specified  tolerance  on  optimality  is  attained,  the  resulting  objective 
function  value  is  more  accurately  labeled  “near-optimal”.  However,  this  is  a  practical 
reality  with  virtually  any  computer  optimization  package,  and  is  not  a  primary  issue  in  our 
research  so  we  will  continue  to  refer  to  the  solutions  SAILS™ -ODS  provides  as 
“optimal”. 

The  optimal  solution  is  obtained  from  SAILS™-ODS  for  each  scenario,  with  its 
associated  optimal  cost  added  to  the  experimental  design  table.  Before  a  sum  of  squares 
analysis  is  performed  on  the  design  table  and  optimal  costs  observed,  the  center  point  row 
should  be  replicated  in  the  table  one  time  for  each  factor  in  the  design.  Since  the  data  in 
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the  table  is  deterministic,  the  center  point  rows  are  used  to  provide  a  measure  of  curvature 
present  in  the  response  surface  (indicating  that  additional  rows  should  be  added  to  the 
design  table  and  the  results  of  the  associated  optimization  runs  used  to  estimate  a  higher- 
order  functional  form  for  the  relationship  of  the  optimal  cost  to  the  factors). 

With  the  appropriate  number  of  center  point  rows  in  the  data  table,  the  statistical 
package  which  generated  the  design  is  used  to  perform  a  sum  of  squares  analysis  on  the 
data,  through  a  linear  regression  or  ANOVA  procedure  with  the  optimal  cost  as  the 
response  variable  and  the  factor  columns  as  the  inputs.  The  software  gives  the  model 
corrected  sum  of  squares,  the  sum  of  squares  for  each  factor,  and  the  value  for  the 
model. 

The  proportion  of  the  model  sum  of  squares  contributed  by  each  factor  is  used  to 
determine  which  factors  most  influence  the  response  (optimal  total  cost).  If  a  grouping 
strategy  is  in  use,  factors  which  contribute  a  very  small  proportion  of  the  model  sum  of 
squares  are  dropped  from  consideration,  and  the  group  factors  contributing  a  large 
proportion  are  broken  into  smaller  groups  of  factors  which  are  used  in  a  new  design.  The 
R  value  provided  can  be  used  to  determine  if  a  quadratic  model  is  justified. 

2.5.  IMPLEMENTATION 

2.5.1.  Background. 

As  mentioned  in  Section  2.3,  the  two  stages  in  a  supply  chain  optimization  project 
which  benefit  from  the  DOE/RSM  approach  are  Data  Collection  &  Analysis  and 
Optimization  &  Sensitivity.  Potential  differences  between  the  two  areas  of 
implementation  are  summarized  in  Table  2.1. 
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Table  2.1.  Differences  between  two  timeline  stages  of  application 
Data  Collection  &  Analysis  Stage  Optimization  &  Sensitivity  Stage 

•  Early  in  the  process.  •  Late  in  the  process. 

•  Focused  on  specific  costs.  •  Broad-stroke  analysis  on  system. 

•  Goal:  Streamline  data  collection  work.  •  Goal:  Provide  insight  for  presentation  and 

focus  excursions  performed  in  this  stage. 
Since  each  supply  chain  optimization  project  is  slightly  different,  the  applications 
of  CIAT  used  in  practice  will  more  likely  fall  along  a  spectrum  between  the  two  columns 
in  Table  2.1.  For  instance,  if  there  is  considerable  uncertainty  about  a  large  number  of 
costs,  CIAT  application  in  the  Data  Collection  &  Analysis  stage  may  more  resemble 
the  process  we  will  describe  as  an  OPTIMIZATION  &  SENSITIVITY  application  in  that 
grouping  strategies  will  be  employed.  However,  designing  CIAT  to  handle  the  entire 
spectrum  of  applications  requires  a  clear  delineation  of  the  distinctions  shown  in  Table  2.1. 

Ranges  on  Costs.  In  DATA  COLLECTION  &  ANALYSIS,  the  concern  is  with  a  small 
collection  of  costs  that  are  subject  to  error  or  difficult  to  obtain.  Therefore,  the  ranges  on 
these  costs  are  generally  set  to  reflect  the  level  of  uncertainty  associated  with  each 
individual  cost  value.  In  contrast,  the  analyst  often  wants  to  vary  many  of  the  model  costs 
in  the  Optimization  &  Sensitivity  stage  by  some  fixed  percentage  of  their  value  to 
provide  a  comfort  level  about  the  results. 

Need  for  Grouping.  Another  associated  difference  is  the  use  of  grouping. 

Analysis  in  the  OPTIMIZATION  &  SENSITIVITY  stage  more  likely  involves  large  numbers  of 
cost  parameters.  However,  there  are  cases  with  widespread  uncertainty  in  the  cost  data 
resulting  in  the  need  for  grouping  in  the  DATA  COLLECTION  &  ANALYSIS  stage. 


2-17 


Number  and  Nature  of  Responses  Investigated.  The  nature  of  the  responses 
investigated  and  functional  form  used  will  also  differ  between  the  two  stages.  In  the  Data 
Collection  &  Analysis  stage,  the  goal  is  to  gauge  the  importance  of  individual  costs  to 
the  bottom  line.  As  a  result,  the  optimal  total  cost  is  generally  the  only  response  of 
interest.  In  contrast,  in  the  OPTIMIZATION  &  Sensitivity  stage,  additional  responses  such 
as  the  number  of  distribution  centers  in  the  optimal  solution  or  the  total  distance  in  the 
distribution  network  may  be  considered. 

Fit  Required  of  Regression  Model.  In  Data  Collection  &  Analysis 
investigations,  the  analyst  will  probably  stick  to  a  linear  design  and  model  unless  the 
value  is  quite  low  (say  below  0.8).  In  Optimization  &  Sensitivity  investigations,  a  high 
fidelity  functional  relationship  between  a  selected  group  of  cost  parameters  and  the 
optimal  total  cost  may  be  desired  for  presentation  purposes. 

2.5.2.  Sample  Supply  Chain. 

PFS  LC  provided  a  sample  supply  chain  problem  along  with  the  associated  input 
data  files  for  the  SAELS^'^-ODS  solver.  The  single-product  supply  chain  is  composed  of 
four  production  facilities  (plants),  seventeen  current  and  twenty-seven  proposed 
distribution  centers  (DCs),  and  378  customers.  The  underlying  geography  is  western 
Pennsylvania  and  portions  of  West  Virginia  and  New  York. 

This  supply  chain  optimization  problem  focuses  on  selecting  the  most  efficient 
collection  of  distribution  centers  and  routes  through  them.  Distribution  center  fixed  costs 
are  those  costs  incurred  simply  by  opening  the  facility,  regardless  of  throughput.  Variable 
costs  for  each  distribution  center  are  variable  with  respect  to  throughput  volume.  Thus, 
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any  cost  which  can  be  calculated  per  cubic  foot  of  warehouse  space  or  in  time  spent 
handling  product  is  included  in  variable  costs. 

The  two  sections  that  follow  explain  how  CIAT  could  be  applied  to  this  sample 
supply  chain  in  the  two  stages  of  the  project  timeline — ^Data  Collection  &  Analysis 
and  Optimization  &  Sensitivity. 

2.5.3.  Scenario  I:  Application  in  Data  Collection  &  Analysis. 

All  costs  associated  with  the  proposed  DCs  are  ranged  to  investigate  their  impact 
on  the  optimal  objective  function  value.  The  proposed  DCs  are  selected  because  their 
costs  are  particularly  subject  to  error  and  time-consuming  to  refine.  For  the  twenty-seven 
proposed  DCs  this  amounts  to  twenty-seven  fixed  costs  and  twenty-seven  variable  costs. 

A  range  of  plus  and  minus  twenty  percent  is  specified  on  the  variable  costs  and  a  range  of 
plus  and  minus  ten  percent  is  used  on  the  fixed  costs.  Since  JMP  does  not  generate  a 
screening  design  for  fifty-four  factors,  the  fixed  costs  are  grouped  as  factor  1  and  the 
variable  costs  as  factor  2  in  the  first  stage  of  a  screening  process. 

The  two  group  factors  are  investigated  through  a  full  factorial  design  with  four 
comer  points  and  two  center  points  included.  The  regression  model  fit  is  first-order  with 
an  interaction  term.  The  value  is  0.958  and  the  adjusted  is  0.895,  so  the  curvature  is 
determined  to  be  negligible.  The  model  sum  of  squares  is  124184.75  with  factor  1  (the 
fixed  cost  group)  contributing  104006.25,  or  83.75%.  Thus,  the  variable  costs  are 
dropped  from  the  investigation  and  the  twenty-seven  fixed  costs  are  retained. 

Since  only  twenty-seven  costs  remained,  we  could  proceed  to  individual  cost 
factors  in  a  twenty-eight  run  Plackett-Burman  experiment  with  additional  center  point(s). 
Another  option  would  be  to  regroup  the  fixed  costs  and  eliminate  more  costs  before 
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testing  them  individually.  Proceeding  to  the  Plackett-Burman  would  provide  a  more 
sensitive  assessment  of  all  the  fixed  costs  and  avoid  the  necessity  of  another  iteration 
through  the  process,  but  using  another  grouping  stage  could  save  several  SAILS™  runs 
overall. 

When  we  proceed  to  the  Plackett-Burman,  it  is  so  highly  fractionated,  that  only 
one  center  point  run  is  included.  As  a  result,  the  value  is  1.000  and  the  adjusted  is 
undefined.  Using  our  bottom-line  method  of  assessing  curvature,  the  center  point  run  is 
not  included  in  the  regression.  Then  the  prediction  error  at  that  point  is  evaluated  in  terms 
of  a  tolerance  on  error  in  the  bottom-line  (optimal  total  cost).  In  this  case,  the  prediction 
error  amounts  to  $29,960  in  a  bottom  line  of  approximately  $58,000,(X)0,  so  it  is 
considered  negligible.  The  regression  model  fit  is  a  simple  linear  model,  with  no 
interaction  terms.  The  model  sum  of  squares  is  220974.96,  with  five  of  the  cost  factors 
contributing  at  a  notably  higher  level  than  any  other  costs.  These  five  proposed  DCs  are 
depicted  in  Figure  2.4  by  letters,  while  the  other  proposed  DCs  are  shown  as  blocks.  The 
greatest  contributor  is  at  Mercer  (R)  with  a  sum  of  squares  of  95238.89  (42.19%) 
followed  by  State  College  (S)  with  51000.89  (23.08%),  Bradford  (B)  and  Warren  (W) 
with  25260.04  (11.43%)  each,  and  Meadville  (A)  with  12900.04  (5.84%).  No  other  fixed 
cost  contribute  more  than  505,75  to  the  model  sum  of  squares.  Hence,  it  is  our 
conclusion  that  since  the  optimal  total  cost  solution  is  most  sensitive  to  changes  in  these 
five  fixed  costs  over  the  ranges  specified,  data  refinement  resources  should  be  focused  on 
these  five  facilities. 

To  further  investigate  the  relationships  between  these  five  fixed  costs  and  the 
optimal  system  cost,  we  use  a  second-order  central  composite  design  on  these  five  factors. 
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with  the  axial  points  set  on  the  face  of  the  hyper-cube  (that  is,  the  axial  settings  are  at  the 
coded  -1  and  +1  levels,  requiring  only  the  upper  and  lower  bound  values  specified).  This 
design  requires  thirty-three  experimental  runs.  However,  since  seven  of  these  are  center 
point  rows,  we  really  only  need  to  perform  the  remaining  twenty-six  optimization  runs, 
then  add  the  center  point  rows  from  our  previous  results.  Most  Data  Collection  & 
Analysis  CIAT  applications  would  stop  before  fitting  a  second-order  regression  model. 
In  some  cases,  though,  this  regression  model  can  provide  useful  insight  into  the  system. 


Fig.  2.4.  Results  of  Data  Collection  &  Analysis  scenario. 


We  find  that  fitting  the  second-order  model  certainly  provides  interesting  results 
in  this  case.  First,  the  second-order  regression  model  reveals  a  substantial  interaction 
between  the  Warren  and  Bradford  fixed  costs.  This  interaction  contributes  14460.06  to 
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the  model  sum  of  squares  242544.19.  Furthermore,  the  quadratic  term  in  the  regression 
model  reflecting  the  cost  at  Warren  clearly  defines  a  convex  surface  (Figure  2.4). 

However,  in  Section  2.4.1  we  state  that  the  optimal  value  function  of  a  mixed  integer 
linear  program  is  concave.  This  contradiction  is  explained  by  considering  the  optimality 
tolerance  used  in  SAILS'^“-ODS. 

Recall  that  our  response  is  actually  “near-optimal”  due  to  the  iterative  method  used 
by  the  solver  to  tackle  these  computationally  complex  problems.  In  a  central  composite 
design  without  replications,  there  are  exactly  three  points  used  to  estimate  the  form  of 
each  of  the  quadratic  terms  in  the  regression  model.  The  incorrect  quadratic  term  in 
Figure  2.4  occurs  because  the  response  values  with  Warren’s  fixed  cost  at  both  the  -10% 
and  midpoint  values  (and  all  other  costs  are  held  at  their  midpoint)  is  $57,978,000,  while 
the  response  value  with  Warren’s  fixed  cost  at  the  +10%  level  is  $58,091,000.  The  value 
at  -10%  is  the  likely  culprit,  with  the  solver  stopping  at  this  value  when  a  slightly  lower 
response  value  exists  as  the  true  optimal. 

There  are  two  possible  courses  of  action  when  something  like  this  occurs.  The 
first  alternative  is  to  re-run  the  entire  design,  or  at  least  the  disputed  points,  with  a  tighter 
optimality  tolerance  set.  We  chose  a  second  alternative,  which  involves  running  axial 
points  outside  the  hyper-cube  boundary.  The  underlying  idea  is  that  by  extending  our 
sample  range  we  will  more  likely  capture  the  concave  nature  of  the  optimal  objective 
function  value.  As  shown  in  Figure  2.5,  pushing  these  axial  points  further  from  the  center 
illuminated  the  true  concave  nature  of  the  response. 
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Fig.  2.5.  Revised  results  of  Data  Collection  &  Analysis  scenario. 

2.5.4.  Scenario  2:  Application  in  Optimization  &  Sensitivity. 

The  potential  variations  in  applying  CIAT  in  this  stage  are  virtually  unlimited. 
Depending  upon  the  particular  problem,  the  analyst  may  want  to  investigate  the 
relationships  between  DC  costs,  transportation  costs,  and  the  response  optimal  system 
cost.  Also,  the  analyst  may  want  to  consider  alternative  responses  such  as  total  network 
transportation  mileage  or  some  customer  service  measures.  Grouping  can  be  used  to 
investigate  the  sensitivity  of  the  solution  to  cost  variation  along  geographic  lines  or  by 
managerial  units. 

We  perform  an  OPTIMIZATION  &  SENSITIVITY  excursion  on  the  sample  supply 
chain  optimization  problem  provided.  Our  goal  is  to  investigate  how  fixed  and  variable 
distribution  center  costs  affect  the  optimal  network  solution.  Since  eighty-eight  costs  are 
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investigated,  a  geographical  grouping  strategy  is  employed.  A  grid  is  specified  in  latitude 
and  longitude  as  shown  in  Figure  2.6.  The  fixed  and  variable  DC  costs  are  grouped 
according  to  the  grid.  The  letters  in  Figure  2.6  label  each  group,  and  the  numbers  reflect 
the  number  of  DCs  in  each  group. 

In  this  scenario,  we  are  not  grouping  costs  in  order  to  screen  out  those  with  little 
impact.  Rather,  we  are  placing  costs  into  groups  because  we  would  expect  their  future 
increases  and  decreases  to  be  correlated  (i.e.  cost  increases  in  Pittsburgh  will  likely  affect 
all  facilities  in  the  Pittsburgh  area).  The  goal  is  to  investigate  the  robustness  of  our  model 
to  variations  in  costs  by  looking  at  the  persistence  of  our  recommended  solution, 
simultaneously  gaining  insight  into  alternative  solutions  which  would  not  otherwise  have 
been  explored. 

For  the  first  stage  in  this  scenario,  ranges  of  plus  and  minus  10%  are  set  on  the 
fixed  and  variable  costs  of  all  forty-four  DCs.  Since  these  costs  are  grouped  into  eleven 
zones  according  to  Figure  2.6,  a  Plackett-Burman  design  with  twelve  comer  point  runs  is 
appropriate.  In  order  to  more  thoroughly  sample  the  design  space,  we  adopt  the  foldover 
technique  from  experimental  design  literature  (see  Myers  and  Montgomery,  1995:172- 
173),  by  multiplying  each  coded  factor  in  the  Plackett-Burman  design  by  -1,  giving  a 
twenty-four  comer  point  design.  A  center  point  optimization  mn  is  also  performed,  since 
we  are  interested  in  knowing  the  active  DCs  in  each  solution,  rather  than  the  optimal 
system  cost  collected  in  the  Data  Collection  &  Analysis  scenario.  In  the  second  set 
of  twenty-five  runs,  the  same  coded  design  table  is  used  with  costs  ranging  30%  in  both 
directions. 
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Fig.  2.6.  Zones  used  in  OPTIMIZATION  &  Sensitivity  scenario. 

For  each  set  of  runs,  tables  and  mosaic  plots  are  generated  to  investigate  how  the 
active  DCs  in  the  solution  are  affected  by  cost  changes.  The  most  valuable  information  for 
the  client  appears  in  Table  2.2  and  Table  2.3.  These  crosstabulations  show  the  frequency 
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of  each  solution  network  and  of  the  individual  DCs  in  the  solutions.  The  DC  numbering 
scheme  is  in  order  of  appearance  in  the  SAILS^m-ODS  input  files.  DCs  numbered  1-17 


are  existing  facilities,  while  those  numbered  18-44  are  candidate  sites.  The  solution 
recommended  by  the  baseline  optimization  (at  the  design  center)  and  its  DC  components 
are  denoted  by  *.  Note  that  this  solution  and  its  components  are  persistent  in  the  presence 
of  10%  and  30%  variation  in  the  cost  data.  However,  other  DCs  are  also  fairly  persistent 
and  merit  further  investigation  as  possible  alternative  recommendations. 

Table  2.2.  Persistence  of  solutions  and  DCs  in  the  presence  of  ±10%  cost  variations . 


* 


DC\Solulian 

2-12-17 

2-12-16-21  2-12  2-17 

2-17-18 

2-1617-21 

2-17-34  2-18 

2-18-21 

Sum  %ofSdutiGns 

*2 

9 

4 

3 

2 

2 

2 

1 

1 

1 

25 

100% 

*12 

9 

4 

3 

16 

64% 

16 

4 

2 

6 

m 

*11 

9 

2 

2 

2 

1 

16 

64% 

18 

2 

1 

1 

4 

16% 

21 

4 

2 

1 

7 

28% 

34 

1 

1 

4% 

Total 

9 

4 

3 

2 

2 

2 

1 

1 

1 

25\75 

Table  2.3.  Persistence  of  solutions  and  DCs  in  the  presence  of  ±30%  cost  variations. 


* 


DC  \  Solution 

2-12-17  2-18-21  2-21-23 

Other  Solutions  (single  occurrences) 

Sum 

%  of  Solutions 

*2 

3 

3 

2 

8 

16 

64% 

4 

6 

6 

24% 

1 

5 

5 

20% 

*12 

3 

3 

6 

24% 

16 

2 

2 

8% 

*17 

3 

8 

11 

44% 

18 

3 

3 

6 

24% 

21 

3 

2 

10 

15 

60% 

23 

2 

5 

7 

28% 

25 

1 

1 

4% 

26 

3 

3 

12% 

34 

3 

3 

12% 

35 

1 

1 

4% 

37 

1 

1 

4% 

2.6.  RESULTS  AND  CONCLUSIONS 

Although  CIAT  has  not  been  used  at  PFS  LC  yet,  they  are  very  enthusiastic  about 
its  potential.  They  were  also  excited  about  the  level  of  automation  present  in  CIAT  and 
its  potential  extensions  to  include  investigation  of  other  costs  and  response  variables. 

Many  of  their  projects  involve  multi-billion  dollar  decisions,  so  every  bit  of  useful 
information  can  have  a  tremendous  impact.  Particularly  useful  are  the  alternative  solutions 
and  information  on  solution  persistence.  Moreover,  the  excursions  performed  in 
Optimization  &  Sensitiyity  can  be  tailored  to  the  client.  The  potential  savings  provided 
in  the  DaTa  COLLECTION  &  Analysis  stage  is  also  substantial.  For  each  day  saved  in  this 
stage,  $1000  of  analyst  time  is  saved,  and  the  client’s  time  is  saved  as  weU. 
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CHAPTER  3 


Now  that  the  CIAT  prototype  is  in  place  at  PFS  LC,  there  are  a  few  obvious 
extensions  for  the  research  contained  in  this  thesis  and  CIAT  itself.  The  first  extensions 
will  occur  at  PFS  LC  by  the  analysts  and  computer  support  people  they  employ.  It 
involves  duplicating  and  modifying  Excel  macros  and  Turbo  Pascal  routines  so  that  other 
types  of  costs  and  responses  can  be  incorporated  in  CIAT  applications. 

The  second  area  for  extension  may  also  occur  through  use  of  CIAT  at  PFS  LC. 
This  entails  developing  general  guidelines  for  grouping  of  cost  factors  in  CIAT 
applications.  Some  grouping  patterns  will  prove  more  efficient  in  that  they  tend  to  require 
fewer  runs  in  order  to  isolate  cost  drivers.  However,  the  best  way  to  identify  these 
general  grouping  guidelines  is  through  empirical  observation. 

The  final  recommended  extension  is  more  theoretical  in  nature.  It  involves 
investigating  or  developing  experimental  designs  most  suited  for  use  in  the  OPTIMIZATION 
&  Sensitivity  stage.  The  best  design  for  application  in  this  area  would  investigate  the 
greatest  number  of  independent  vectors  from  the  design  center  in  a  minimal  number  of 
runs.  It  might  also  be  preferable  to  focus  entirely  on  vectors  where  no  costs  are 
decreased,  since  the  decision  maker  is  usually  most  concerned  with  how  combinations  of 
cost  increases  may  affect  the  network  solution. 
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APPENDIX  A:  PROBLEM  ENVIRONMENT 


Supply  chain  management  describes  the  process  of  procuring  the  necessary 
raw  materials,  producing  products,  and  distributing  them  to  customer  zones.  There  is  an 
implicit  network  configuration  to  this  process,  as  demonstrated  in  Figure  A.l.  The 
objective  of  supply  chain  management  is  to  design  and  utihze  such  networks  at  minimum 
cost. 


Raw  Mat.  Production  Distribution  Customer 

Vendor  Facilities  Centers  Zones 


Fig.  A.I.  A  simple  supply  chain. 


Supply  chain  issues  fall  into  three  categories  based  upon  their  scope  and  time- 
frame.  Operational  issues  are  those  dealing  with  the  details  of  day-to-day  operations,  such 
as  assigning  a  specific  vehicle  to  a  hst  of  deliveries.  Tactical  issues  are  short-range  plans, 
usually  with  a  time  horizon  of  less  than  one  year,  such  as  choosing  production  schedules 
to  meet  expected  demands.  Strategic  issues  are  long-range  plans  for  the  supply  chain, 
particularly  decisions  on  the  structure  of  the  chain.  (Barlaz,  1996;  Concepts:  A  Handbook 
for  SAILS™  Users,  Volume  1, 1995:1-4;  Ganeshan  and  Harrison,  1995:2) 

Supply  chain  optimization  uses  mathematical  programming  techniques  to  answer 
the  questions  in  strategic  supply  chain  management: 
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•“How  many  distribution  centers  (DCs)  are  in  the  best  (optimal)  solution?”, 
•“Where  should  we  place  our  DCs?”, 

•“Which  DCs  should  each  production  facility  feed?”, 

•“Which  customer  zones  should  be  supplied  by  which  DCs,  and  to  what  degree?”, 
and  sometimes 

•“How  many  plants  are  in  the  optimal  solution?”, 

•  “Where  should  the  plants  be  located?”, 

•“How  should  products  be  allocated  to  plants?”,  and 
•“Where  should  raw  materials  be  acquired?”. 

The  mathematical  programs  formulated  to  answer  these  questions  are  large-scale 
mixed  integer  linear  programs  (MIPs)  with  specialized  structure.  There  is  such 
widespread  interest  in  supply  chain  optimization  in  today’s  corporations  that  specialized 
software  packages  have  been  developed  to  solve  these  MIPs.  In  fact,  consulting  agencies 
have  developed  an  expertise  in  supply  chain  analysis  and  proper  use  of  these  optimization 
packages.  PFS  Logistics  Consulting,  A  Division  of  PepsiCo,  Inc.,  (PFS  LC)  is  one  of 
these  agencies. 

PFS  management  recognized  their  company  needed  this  type  of  analysis  early  in 
the  development  of  supply  chain  optimization.  As  a  result,  analysts  at  PFS  began  using  a 
package  called  SABLS''^'^  (Strategic  Analysis  of  Integrated  Logistics  Systems)  while  it  was 
stiU  in  development,  and  PFS  LC  was  bom.  Today,  analysts  at  PFS  LC  periodically 
evaluate  distribution  stmctures  within  their  organization  to  find  areas  for  improved 
efficiency.  They  also  solve  problems  for  external  clients  who  are  contemplating  a  change 
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in  their  supply  chain  or  who  simply  want  to  evaluate  their  current  way  of  doing  business. 
(Barlaz,  1996;  Clarkson,  1996;  Hansen,  1996;  Villareal,  1996) 

Whatever  the  project,  PFS  LC  performs  the  following  six  stages  of  analysis,  with 
approximate  time  required  given  parenthetically  (Hansen,  1996): 

Stage  1.  Model  Design  (2-3  days) 

Stage  2.  Data  Collection  &  Analysis  (1-2.5  months) 

Stage  3.  Model  Building  (2  weeks) 

Stage  4.  Validation  (1  week) 

Stage  5.  Optimization  &  Sensitivity 

Stage  6.  Recommendations  (3-5  weeks  for  stages  5  and  6  combined) 
Stage  2,  Data  Collection  &  Analysis,  takes  the  largest  portion  of  the  project’s  total 
time.  This  is  not  surprising  since  the  data,  cost  factors  and  location  information  drive  the 
optimization  results. 

Based  upon  conversations  with  analysts  at  PFS  LC,  cost  data  used  in  the  model  is 
particularly  time  consuming  to  acquire  and  subject  to  error.  Since  some  sites  which  are  to 
be  evaluated  do  not  even  exist  at  the  time  of  the  analysis,  these  costs  are  “engineered” 
from  a  zero  base  or  by  use  of  regression  analysis  of  costs  as  a  function  of  facility  size. 

Even  with  sites  already  in  operation,  it  may  be  difficult  to  properly  categorize  costs  as 
fixed  or  variable  with  the  size  of  the  facility,  due  to  inconsistant  accounting  procedures. 
(Barlaz,  1996;  Clarkson,  1996;  Hansen,  1996;  Villareal,  1996)  Clients  plan  to  implement 
changes  recommended  in  Stage  6  with  potentially  significant  impacts  on  their  bottom  line, 
so  they  naturally  want  all  data  to  be  very  accurate.  Also,  if  the  initial  recommendations 
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seem  particularly  counter-intuitive  or  unacceptable  to  the  client,  the  process  returns  to 
Model  Design  and  Data  Collection  &  Analysis  to  isolate  and  correct 
inconsistencies.  This  may  improve  the  results  of  the  optimization  or  provide  a  stronger 
base  of  support  for  the  previous  results.  (Barlaz,  1996;  Clarkson,  1996;  Hansen,  1996; 
Villareal,  1996)  Consequently,  those  in  charge  at  PFS  LC  see  an  opportunity  for  process 
improvement  in  these  six  project  stages.  Specifically,  they  would  like  to  pare  down  the 
time  required  for  Stage  2  and  perhaps  begin  to  communicate  preliminary  findings  to  their 
clients  in  this  stage  in  order  to  correct  potential  inconsistencies  earlier  in  the  process. 
(Barlaz,  1996;  Clarkson,  1996;  Hansen,  1996;  Villareal,  1996) 

The  objective  of  this  research  is  to  assist  them  in  their  efforts  to  pare  down  the 
Data  Collection  &  Analysis  stage  and  enhance  client  involvement  and  satisfaction. 
Since  initial  estimates  of  costs,  demands,  and  distances  used  in  the  model  are  often  readily 
available,  an  initial  model  run  could  use  these  values  for  a  rough  cut  solution.  Then  use  of 
a  technique  which  identifies  the  parameters  with  the  greatest  impact  on  the  solution  would 
allow  analysts  at  PFS  LC  to  focus  time  and  effort  in  Stage  2  on  these  significant 
parameters.  Essentially  what  they  need  is  a  tool  and  procedure  for  performing  optimality 
analysis  on  the  parameters  in  these  large-scale  MIPs. 

As  a  result  of  these  practical  issues  -  in  addition  to  several  theoretical 
considerations  which  make  optimahty  analysis  of  objective  function  parameters  more 
straightforward  than  sensitivity  of  technological  coefficients  and  right-hand-side  vectors  - 
this  research  wiU  focus  on  the  optimality  analysis  of  the  cost  coefficients. 
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APPENDIX  B:  LITERATURE  REVIEW 
B.l.  SUPPLY  CHAIN  OPTIMIZATION 

Although  the  term  supply  chain  management  had  not  yet  been  coined  in  1974,  in 
an  article  published  that  year,  Geoffrion  and  Graves  present  a  now  standard  formulation 
for  the  basic  strategic  supply  chain  optimization  problem  and  explain  a  decomposition 
technique  which  solves  these  large-scale  mixed  integer  programs  with  relative  efficiency. 
Below,  I  discuss  their  article  in  great  detail  since  the  model  formulation  and  solution 
technique  presented  by  Geoffrion  and  Graves  are  the  foundation  of  the  SAILS^^-ODS 
solver  engine,  one  of  two  in  the  software  package  used  at  PFS  LC. 

One  major  assumption  in  Geoffrion  and  Graves’  formulation  makes  the  solution 
technique  possible  -  they  require  that  each  customer  zone  is  supplied  by  a  single 
distribution  center  (DC).  (They  assert  that  this  requirement  often  holds  in  practice.) 
Consequently,  if  sole-sourcing  is  inappropriate,  this  formulation  and  solution  approach 
should  not  be  used.  (In  these  cases  analysts  at  PFS  LC  would  employ  the  SAILS 
OPTIMA  solver,  which  does  not  make  this  restriction.) 

Before  presenting  their  formulation,  the  following  parameters  and  variables  must 
be  defined. 

Indices:  i  for  commodities 

j  for  plants 

k  for  potential  DCs 

/  for  customer  zones 

Parameters:  Sy  commodity  i  production  capacity  at  plant  j 
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D,7  commodity  i  demand  at  customer  zone  I 

Ym,  Yk  lower  and  upper  bounds  on  annual  throughput  at  DC  k 

fjfc  annual  fixed  cost  for  attaining  and  operating  DC  k 

Wk  variable  per  unit  cost  on  throughput  at  DC  k 

Cijki  average  per  unit  production  and  shipping  cost  of 

commodity  i  produced  at  plant  j  and  sent  through  DC  k  to 
customer  zone  / 

Variables:  Xy«  real- valued  and  non-negative  amount  of  commodity  / 

shipped  from  plant  j  through  DC  k  to  customer  zone  I 
Yki  a  binary  variable  with  value  1  if  DC  k  serves  customer  zone 

I,  0  otherwise 

Zk  a  binary  variable  with  value  1  if  a  DC  is  needed  at  site  0  if 
the  solution  does  not  include  a  DC  at  site  k. 

The  problem  formulation  is  then: 

Min  GjklXjkl  fkZk  -(-V)c2i ;;  Di/yfa] 

s.t.  (1)  Xw  ^  Sij,Vij,  (Capacity  Constraints) 

(2)  '^jXjki=  Diyki,Viki,  (Demand  Constraints) 

(MCDS) 

(3)  ^i,yid  =  l,Vi,  (Sole  -  sourcing  Constraints) 

(4)  VkZk  <  ^i/Diyid  <  UZk,Vk,  (DC  Throughput  Constraints) 

(5)  other  linear  constraints  on  y  and  z  as  needed, 

where  (MCDS)  stands  for  Multi-Commodity  Distribution  System.  (Geoffrion  and  Graves, 

1974:823) 
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An  explanation  of  the  objective  function  and  constraints  of  this  model  is  in  order. 
The  objective  is  to  minimize  the  sum  of  production  and  shipping  costs  ),  fixed 

costs  ( X((  ),  and  variable  costs  incurred  by  operating  the  required  DCs  at  the 

determined  capacity  (  Xk  ).  The  constraints  at  (1)  ensure  that  production 

capacity  for  a  given  commodity  is  not  exceeded  at  any  plant.  The  constraints  at  (2)  force 
demand  for  a  given  commodity  to  be  met  exactly  at  each  customer  zone.  They  also 
restrict  available  delivery  routes  to  those  containing  a  DC  activated  to  serve  the  given 
customer  zone,  since  y«  is  zero  unless  DC  k  serves  customer  zone  /.  If  these  constraints 
were  formulated  as  a  >  inequality,  they  would  not  prohibit  demand  from  being  met  along 
routes  which  are  closed  (i.e.  routes  with  y«  =  0).  The  constraints  at  (3)  enforce  the  sole- 
source  assumption  mentioned  previously;  they  prevent  any  customer  from  being  served  by 
multiple  DCs.  The  constraints  at  (4)  enforce  lower  and  upper  capacity  bounds  on 
throughput  at  each  DC,  and  ensure  that  no  flow  is  allowed  through  DCs  which  are  not 
opened.  The  linear  constraints  at  (5)  can  incorporate  problem-specific  logical 
relationships  in  DC  location  combinations,  sourcing  of  customer  zones  by  DCs,  and 
customer  service  specifications. 

Geoffrion  and  Graves  note  that  their  (MCDS)  formulation  differs  from  previous 
formulations  of  the  warehouse  location  problem.  Earlier  models  use  one  set  of  variables 
for  transportation  of  product  to  the  DCs  and  a  separate  set  for  transportation  of  product 
from  the  DCs  to  the  customer  zones,  instead  of  one  set  of  variables  with  four  subscripts 
(indexing  commodities,  plants,  DCs  and  customer  zones).  Geoffrion  and  Graves  assert 
that  their  formulation  is  more  applicable  to  real-world  instances  of  the  problem,  since  it 
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tracks  the  product  throughout  the  distribution  chain.  As  a  result,  issues  such  as  travel 
time  constraints  on  perishable  goods  or  the  option  to  route  some  product  directly  to 
customers  through  a  fictitious  DC,  can  be  accommodated.  (Geoffiion  and  Graves, 
1974:824) 

Geoffiion  and  Graves  solve  (MCDS)  using  a  variant  of  Benders’  Decomposition. 
The  sole-sourcing  requirement  allows  the  decomposition  into  a  master  problem  in  the 
binary  variables  and  a  standard  linear  transportation  subproblem  for  each  commodity. 
Assuming  a  basic  understanding  of  Benders’  Decomposition  algorithm,  Geoffiion  and 
Graves’  explanation  of  their  implementation  is  hard  to  follow.  Therefore,  a  brief  review  of 
the  general  Benders’  Decomposition  approach  follows. 

This  explanation  of  Benders’  Decomposition  is  taken  from  Optimization  Theory 
for  Large  Systems  by  L.S.  Lasdon  and  from  a  lecture  by  J.  T.  Moore.  OLasdon,  1970:370- 
389;  Moore,  1996) 

Consider  the  following  problem  which  is  linear  in  x  for  fixed  y. 

Min  c'x  +  f  (y) 

s.t.  Ax4-g(y)>b  (P) 

X  >  0,  y  e  S. 

Define  the  set  of  all  feasible  choices  of  y  as  R  =  {y  e  SBx  >  0  3  Ax  >  b-g(y)}.  By 
application  of  Parkas’  Lemma,  y  is  feasible  for  (P)  if  and  only  if  [b  -  g(y)]'u  <  0, 

Vu  3  A'u  <  0.  Since  these  u  vectors  form  a  polyhedral  cone,  we  can  write 
u  =  /l/>0,  where  the  u[  are  the  extreme  rays  of  the  cone 

C  =  {ulA'u  <  0,u  >  0}.  So  we  now  write  R  =  {y  e  Sl[b  -g(y)]'u[  <  0, 
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for  /  =  1,2,..., r>}.  Thus  (P)  becomes  Min (f  (y)  +  Min{c'xlAx  >  b -  g(y),x  >  0} }.  By 

yeR 

the  duality  theorem  of  hnear  programming,  we  can  replace  the  inner  minimization  by  its 
dual  to  obtain  Min{f  (y)  +  Max{[b  -  g(y)]'ulA'u  <  c,u  >  0} }.  If  we  let 

ysR 

uf ,  for  /  =  1,2,...,  Hp,  denote  the  extreme  points  of  the  feasible  region  for  the  inner 


maximization,  then  we  have 
Minz 

s.t.  z> f(y)  +  [b-g(y)]'uf,  for/ =  1,2,- -^np  (MP) 

y  eR  =  {y  G Sl[b - g(y)]X  ^ 0.  for  /  =  1,2,---, A}. 

If  we  can  find  (z*,  y*,  x*)  so  that  (z*,  y*)  solves  (MP)  and  x*  solves 
Min  c'x 

s.t.  Ax>b-g(y*)  (SP*), 

x>0, 

where  the  problem  is  denoted  (SP*)  to  reflect  its  dependence  upon  the  solution  (z*,  y*) 
found  for  (MP),  then  (y*,  x*)  solves  (P). 

Benders’  Decomposition  uses  this  result,  and  the  fact  that  few  of  the  constraints  of 
(MP)  are  actually  binding  at  optimality,  to  define  an  iterative  algorithm  for  this  type  of 
problem.  Specifically,  let  (MPo)  represent  (MP)  with  no  constraints  added,  i.e. 


MinZ 
s.t.  yeS. 


(MPo) 


Then  use  the  dual  to  (SPo) — ^the  problem  resulting  when  (SP*)  incorporates  the  solution 
(z°,  y°)  for  (MPo) — ^to  check  for  optimality  and  add  any  violated  constraints  (called 
Benders’  cuts)  to  (MPo),  and  repeat  the  process  until  reaching  optimality.  At  each 
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iteration  h,  the  optimality  test  relies  upon  the  fact  that  an  optimal  solution  (z*,  y*)  to  (MP/,) 
is  optimal  for  (MP)  if  and  only  if  z  -  f(y*)  equals  the  dual  solution  of  (SPa).  The  dual 
problem  to  (SP*)  is  given  by 

Max  1^=  [b-g(y'')]'u 

u 

s.t.  A'u  <  c  (DP/,) 

u>0. 

If  the  solution  to  (MP/,)  is  not  optimal  for  (MP)  then  construct  (MP/,+7)  by  adding  the 
constraint  [b-g(y'')]'u''  >  Z-f(y''),  and  repeat. 


The  algorithm  is: 

Step  1:  Set  iteration  counter  h  to  0. 

Step  2:  Solve  (MP/,). 

If  (MP/,)  is  infeasible,  then  (P)  is  infeasible. 

Otherwise:  If  (MP/,)  has  a  finite  optimal  solution,  then  denote  that 
solution  by  (z*,y*).  If  (MP/,)  is  unbounded,  then  set  z*  to  negative  infinity 
and  select  some  arbitrary  y*  in  S.  Proceed  to  Step  3. 

Step  3:  Solve  (DP/,). 

If  (DP/,)  is  infeasible,  then  (P)  is  unbounded. 

If  (DP/,)  is  unbounded,  then  go  to  Step  6. 

Otherwise,  denote  the  solution  to  (DP/,)  as  and  the  objective 
function  value  as  w*  and  proceed  to  Step  4. 

Step  4:  Perform  the  optimality  test. 
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If  z*  -  f(y*)  equals  w*,  then  (z*,y*)  is  optimal  for  (MP).  Find  the 
corresponding  optimal  value  for  x  either  by  duality  methods  and  use  of  u*, 
or  by  solving  (SP*). 

Otherwise,  continue  at  Step  5. 

Step  5:  Create  (MP^+i)  by  adding  the  constraint  Z  >  [b  -  g(y'')]'u'’  +  f  (y*")  to 
(MPfe).  Set  htoh+1  and  return  to  Step  2. 

Step  6:  Since  the  problem  at  Step  3  was  unbounded,  we  need  to  determine  the 
extreme  point  u*  and  extreme  ray  v*  such  that  the  objective  function  value 
approaches  infinity  along  the  line  u  =  u*  +  A,v* ,  for  non-negative  values  of  X. 
Then  we  constmct  (MP^+y)  by  adding  the  constraint  [b  -  g(y'’)]'v'’  <  Oto 
(MP,).  If  <  [b-g(y'-)]V  -HfCy"),  then  z>  [b-g(y'’)]V  -hfCy")  should 
also  be  added  to  (MP/,+y).  Set  h  to  h+l  and  return  to  Step  2. 

The  algorithm  must  conclude  in  a  finite  number  of  steps  since  we  would  eventually 
generate  all  of  the  finite  number  of  constraints  to  (MP).  (Lasdon,  1970:370-389;  Moore, 
1996) 

Returning  to  Geoffrion  and  Graves  application  of  Benders’  Decomposition  to 
solve  their  supply  chain  optimization  problem  (MCDS),  in  the  master  problem,  the  binary 
variables  are  set  so  that  constraints  (3)  through  (5)  are  satisfied.  The  problem  can  then  be 
separated  into  a  set  of  standard  linear  transportation  problems,  one  for  each  commodity, 
which  are  combined  to  form  the  Benders’  subproblem.  Solving  the  subproblem  at  each 
iteration  provides  an  upper  bound  on  the  optimal  solution,  since  each  set  of  transportation 
networks  solved  is  a  feasible  solution  for  the  distribution  system.  The  dual  variables  of  the 
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subproblem  solution  are  then  used  to  add  cuts  to  the  binary  master  problem.  Solution  of 
the  master  problem  at  each  iteration  provides  a  lower  hound  on  the  solution,  since  it 
selects  the  lowest  fixed  and  variable  cost  network  configuration  without  all  of  the  capacity 
and  demand  constraints  enforced.  These  upper  and  lower  bounds  converge  with 
increasing  iterations,  and  the  process  stops  when  the  difference  between  the  least  upper 
bound  and  the  greatest  lower  bound  is  less  than  a  specified  tolerance  on  optimality.  The 
algorithm  given  by  Geoffnon  and  Graves  follows.  (Geoffiion  and  Graves,  1970:827-30) 

Step  0:  Set  the  iteration  counter,  H,  to  0.  Select  convergence  tolerance  e  >  0. 

Set  UB  to  infinity  and  LB  to  negative  infinity.  If  a  binary  vector  solution  for  y 
and  z  is  known  which  satisfies  (3)  through  (5),  then  proceed  to  Step  2  with 
(y\  z^)  set  to  the  known  feasible  solution.  Otherwise,  proceed  to  Step  1. 

Step  1:  Solve  master  problem  H, 

Min  Yo  +  2,  J  fkZk  +\4X«  Diiyki] 

s.t.  (3),  (4),  (5)  from  original  problem  formulation,  and, 
ifH>0, 

+  !,•%■,  for  /7=  1,2,---,H 

where  Yo  is  unrestricted  in  sign  and  continuous,  and  and  are  the 
dual  variable  values  for  the  demand  constraints  and  supply  constraints  of 
linear  subproblem  h. 

Step  2:  Denote  the  optimal  solution  found  by  (y^^',  z^'^^  Yo^^^ ).  Set  LB  to  the 
optimal  objective  function  value.  If  UB  is  less  than  or  equal  to  LB  +  e,  stop 
with  the  conclusion  that  the  current  solution  is  e-optimal.  Otherwise  proceed 
to  Step  3. 
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Step  3:  Solve  the  linear  subproblem, 

Min  GikiXiiki 

s.t.  (1)  and  (2)  of  the  original  problem  formulation, 
with  Xjki  real  and  non  -  negative,  with  y  = 

Let  denote  the  optimal  solution  and  T(y^^^)  the  optimal  objective  function 
value.  Then  U  =  X)([  fxZx  +WfXf/  Dnyid]  +  T(y^^')  is  an  upper  bound  on  the 
solution  to  the  original  problem.  If  U  <  UB,  then  replace  UB  with  U,  store 
)  as  the  current  candidate  solution,  and  stop  if  UB  is  less  than 
or  equal  to  LB  +  e,  with  the  current  solution  as  e-optimal.  Otherwise  find  the 
dual  solution  for  this  subproblem  with  y  =  y^'^^  and  denote  it  by 
where  corresponds  to  the  supply  constraints  at  (1)  and  corresponds 
to  the  demand  constraints  at  (2).  Increment  H  by  1  and  return  to  Step  1. 
Geoffrion  and  Graves  also  present  their  practical  experience  with  supply  chain 
optimization  and  explain  how  lessons  learned  from  that  experience  are  incorporated  into 
the  formulation  and  solution  approach  presented.  They  outline  eight  types  of  model  rans 
which  they  see  as  common  to  the  supply  chain  optimization  process. 

•  First  are  the  “probationary  exercises”  used  to  test  the  model  and  begin  its  validation. 

•  Next,  “regional  optimization”  runs  are  completed  for  further  validation  and  to  provide 
first  insights  into  the  distribution  system  solution. 

•  Only  then  is  a  “global  optimization”  run  performed. 

The  final  five  types  of  rans  address  issues  raised  by  the  global  optimization  results  and 
refine  the  information  gained  before  presentation  to  decision  makers. 
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•  “What  if. . scenarios  may  be  questions  triggered  by  the  global  optimization  or  issues 
of  particular  interest  to  the  decision  makers. 

•  “Sensitivity  analysis”  runs  vary  the  input  parameter  values  to  check  on  their  impact  on 
the  optimal  solution. 

•  “Continuity  analysis”  runs  vary  data  in  the  constraints  to  see  when  small  changes 
produce  large  differences  in  the  solution  due  to  the  discrete  nature  of  the  solution 
space. 

•  “Tradeoff  analysis”  runs  consider  objectives  which  may  compete  with  the  minimization 
of  cost,  such  as  customer  service. 

•  Finally,  “priority  analysis”  runs  are  used  to  prioritize  recommendations  in  order  of 
decreasing  savings  potential. 

Geoffrion  and  Graves  explain  that  the  Benders’  Decomposition  solution  technique  is 
particularly  efficient  for  such  a  sequence  of  related  runs.  The  Benders’  cuts  generated  for 
the  original  problem  can  be  adjusted  to  hold  for  small  variations  of  the  problem.  With 
these  modified  cuts  already  in  place,  we  solve  the  new  problem,  often  resulting  in  a  faster 
solution  than  without  the  modified  cuts  included.  (Geoffrion  and  Graves,  1970:831,  833- 
836) 

Note  that  sensitivity  and  continuity  analysis  are  precisely  the  information  required 
by  PFS  LC.  However,  they  would  like  a  systematic  approach  to  this  analysis  which  can  be 
carried  out  early  in  the  analysis  process  and  does  not  involve  running  the  model  for  every 
possible  combination  and  level  of  parameter  change.  For  the  size  problem  dealt  with  at 
PFS  LC,  it  is  not  feasible  to  run  the  tremendous  number  of  scenarios  this  would  involve. 
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Also  of  particular  significance  to  my  thesis  research,  Geoffrion  and  Graves  confine 
continuity  analysis  to  changes  in  the  parameters  of  the  constraints  since  “[c]hanges  in  data 
appearing  only  in  the  objective  function... cannot  lead  to  such  behavior”  as  a  discontinuous 
change  in  the  feasible  region  of  the  problem.  (Geoffrion  and  Graves,  1970:834)  Thus,  the 
cost  coefficients  should  be  more  amenable  to  the  type  of  systematic  sensitivity  analysis  we 
seek  than  would  other  model  parameters. 

Geoffrion  and  Graves  apply  their  formulation  and  solution  approach  to  a  large 
real-world  problem  instance  for  Hunt-Wesson  Foods,  Inc.  In  their  implementation,  they 
chose  not  to  optimize  the  master  problem  at  each  iteration,  but  to  stop  with  a  feasible 
solution  more  than  e  below  that  currently  designated  as  UB.  With  this  modification  of  the 
approach,  the  lower  bound  is  not  needed,  and  the  stopping  criterion  is  now  met  when  no 
feasible  solution  to  the  master  problem  gives  a  value  more  than  e  below  the  current  UB. 
This  sub-optimization  method  saves  computations,  particularly  in  the  early  iterations  when 
we  are  presumably  far  from  our  eventual  solution.  They  were  struck  by  the  low  number  of 
iterations  required  to  achieve  even  tight  tolerances,  generally  3-7  iterations.  They  note 
that  re-optimization  for  the  eight  common  types  of  scenarios  did  not  generally  reduce  the 
number  of  iterations  required,  but  did  provide  computational  savings.  This  real-world 
application  for  Hunt-Wesson,  Inc.  led  them  to  eventual  development  of  SAILS^^'^-ODS,  as 
described  in  the  Geoffiion  and  Powers  article.  (Geoffrion  and  Powers,  1993) 

It  is  interesting  to  note  that  Geoffrion  and  Powers  see  their  SAILS™-Ods 
package  as  the  only  commercial  solver  which  employs  Benders’  Decomposition,  in  spite  of 
the  advantages  to  this  approach  when  the  sole-sourcing  assumption  is  valid.  In  fact, 
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Geoffrion  and  Powers  are  surprised  and  disappointed  to  report  that  heuristic  approaches 
to  supply  chain  design  and  improvement  are  still  more  widely-used  than  optimization 
approaches.  (Geoffrion  and  Powers,  1993:9-10) 

R.L.  Breitman  and  J.M.  Lucas  outline  a  tool  used  by  General  Motors  for  all 
aspects  of  supply  chain  management.  (Breitman  and  Lucas,  1987)  Their  system, 
PLANETS,  is  a  flexible  system  designed  to  model  a  wide  array  of  management  decisions 
made  at  all  levels  of  GM.  PLANETS  was  developed  in  1974  to  enable  GM  decision 
makers  to  formulate  and  solve  mathematical  models  addressing  recurring  questions  and  ad 
hoc  analysis  issues.  The  current  network  of  computer  programs  has  a  user  interface  which 
is  menu-driven  and  uses  conversational  business  terminology  rather  than  mathematical 
programming  lingo.  Once  a  business  situation  of  interest  is  outlined  through  use  of  the 
user- interface,  PLANETS  automatically  formulates  and  solves  the  associated  mixed- 
integer  programming  problem,  using  MPSX  and  SCICONIC  as  the  solver  engines. 

Reports  are  generated  according  to  the  users  selections  from  such  groups  as  “process 
center  utilization”,  “material  usage”,  “product  cost  allocation”,  “product- shipment-by¬ 
facility”,  “investment  summary”,  “facility-revenue-and-cost”,  and  “facility-income-and- 
cash-flow”.  Reports  on  the  sources  of  infeasibility  and  sensitivity  to  parameters  are  also 
available.  (Breitman  and  Lucas,  1987:97-98) 

Breitman  and  Lucas  observe  that  a  typical  PLANETS  problem  formulation  has 
over  10,000  variables,  over  100  of  which  are  restricted  to  integer  values.  Yet,  PLANETS 
formulates  the  problems  and  generates  structured  input  for  the  solver  in  about  one  to  two 
minutes.  (Breitman  and  Lucas,  1987:97) 
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It  is  not  clear  how  the  sensitivity  analysis  report  is  generated  for  such  large-scale 
mixed  integer  programming  problems.  Breitman  and  Lucas  do  not  divulge  the 
methodology  used  to  generate  this  report.  In  fact,  they  do  not  even  state  whether  this 
report  is  built  by  PLANETS  or  is  part  of  the  output  provided  by  the  MPSX  or  SCICONIC 
solver.  Moreover,  we  do  not  know  if  the  sensitivity  report  provides  information  on  how 
much  simultaneous  change  can  occur  in  the  parameters,  or  if  parameters  are  tested  only 
one  at  a  time. 

B.2.  OPTIMALITY  ANALYSIS  IN  LINEAR  AND  INTEGER  PROGRAMMING 

Once  we  solve  a  mathematical  programming  problem,  our  concern  turns  to  how 
small  changes  in  the  problem  data  affect  the  solution.  This  need  arises  because  the 
model’s  parameter  values  are  often  mere  estimates  or  postulates  of  the  tme  but  unknown 
values.  Consequently,  widespread  investigation  has  produced  a  plethora  of  theory  and 
techniques  on  optimality  analysis  in  linear  programming. 

For  example,  we  can  substitute  b  +  9v  for  the  right-hand- side  vector  b,  where  v 
represents  the  direction  of  change  and  0  is  a  nonnegative  scalar.  Starting  with  the  original 
optimal  solution  (at  9  =  0)  we  increase  0  until  loss  of  feasibility  for  the  optimal  basis  at 
some  0  =  01  >  0.  Then  the  dual  simplex  method  is  used  to  find  the  new  optimal  solution 
which  comes  into  effect  at  9i.  The  process  repeats  to  find  the  remaining  breakpoints  02, 

03, . . .,  0w  until  further  increases  in  0  have  no  effect;  i.e.  the  basis  remains  optimal  for 
additional  changes  in  0  or  no  feasible  solution  exists  for  0  >  0^/.  (The  cost  vector  can 
similarly  be  parametrized  and  investigated.)  See  Bazaraa,  Jarvis,  and  Sherali  for  a 
complete  development  of  these  parametric  analysis  techniques.  (Bazaraa,  Jarvis,  and 
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Sherali,  1990:278-301)  Also  see  Ward  and  Wendell  for  a  unified  approach  to  optimality 
analysis  that  generalizes  the  above  approaches  and  presents  the  full  theoretical  basis  for 
these  methods  in  linear  programming.  (Ward  and  Wendell:  1990) 

All  approaches  to  optimality  analysis  -  a  term  encompassing  sensitivity  analysis, 
parametric  analysis,  and  other  techniques  -  explained  so  far  are  rather  limiting  in  that  they 
restrict  parameter  investigation  to  occur  along  a  linear  cut  specified  by  the  direction  vector 
V.  An  alternative  approach  applies  the  statistical  techniques  of  experimental  design  and 
response  surface  methodology  to  constmct  a  surface  mapping  of  the  optimal  objective 
value  as  a  function  of  the  problem  data. 

P.W.  Smith  first  proposed  this  use  of  design  of  experiments  (DOE)  and  response 
surface  methodology  (RSM)  as  an  approach  to  optimality  analysis  of  parameters  in  1975. 
His  methodology  describes  the  effect  of  simultaneous  parameter  variation  in  two  types  of 
mathematical  programming  formulations  of  an  economic  model:  a  linear  programming 
formulation  and  a  formulation  with  linear  constraints  and  a  quadratic  objective  function. 

In  all  formulations  the  variables  used  were  continuous  in  nature.  (P.W.  Smith,  1975) 

Although  the  peculiarities  of  the  economic  models  under  investigation  obscure  his 
general  methodology,  it  is  clear  that  he  focuses  on  simultaneous  but  distinct  changes  in  the 
elements  of  the  right-hand-side  vector,  b.  He  also  uses  a  second-order  experimental 
design,  containing  design  points  required  to  estimate  the  parameters  in  a  quadratic 
response  surface  through  least  squares  regression,  where  the  factors  fixed  at  each  design 
point  are  the  elements  of  b. 

Smith  also  recognizes  four  areas  of  concern  in  his  approach: 
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1.  The  right-hand- side  elements  in  his  problems  are  interdependent,  greatly  restricting  his 
freedom  in  design  selection.  Specifically,  some  settings  of  b,  precluded  certain 
simultaneous  settings  of  bj.  (P.  W.  Smith,  1975:54-55) 

2.  Reahstic  economic  models  contain  too  many  coefficients  for  practical  application  of 
experimental  design.  (P.  W.  Smith,  1975:44) 

3.  The  true  response  surface  is  a  faceted  figure  with  sharp  comers  and  discontinuities 
rather  than  the  smooth  continuous  surface  constmcted  through  use  of  DOE  and  RSM. 
(P.  W.  Smith,  1975:77) 

4.  Experimental  designs  concentrate  on  minimizing  variance  error  rather  than  the  bias 
error  upon  which  his  analysis  focuses.  (P.  W.  Smith,  1975:34,  41-2) 

K.  E.  Johnson  et.  al.  address  items  (2)  and  (3).  (Johnson,  Bauer,  Moore,  and 
Grant,  1996)  They  also  give  a  clear  and  detailed  outline  of  the  approach,  as  summarized 
below. 

The  authors  deal  with  the  general  linear  programming  model 

Minimize  c'x 
subject  to:  Ax  =  b 
x>0, 

where  c  is  an  n- vector  of  specified  cost  coefficients,  A  is  an  mxn  matrix  of  given 
technology  coefficients,  b  is  an  m- vector  of  known  resource  levels  called  the  right-hand- 
side  vector,  and  x  is  an  n- vector  of  variables.  Focusing  on  a  technique  for  optimality 
analysis  of  b,  their  general  outline  of  the  response  surface  methodology  procedure  is: 

Step  1:  Specify  a  high  and  low  range  for  each  element  of  b,  b,.  Code  these  so  that 

the  high  level  is  given  by  z,-  =  4-1  and  the  low  level  by  z,-  =  -1  by  using 
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z/  =  (b,-  -  niidpoint)/half-range.  Use  experimental  design  techniques  to  specify 
the  combinations  of  parameter  levels  (Le.  combinations  of  z,  =  +1  or  z;  =  -1  for 
i  =  1 , 2, . . . ,  m)  to  be  run  in  the  LP.  Let  s  represent  the  number  of  runs 
specified  by  the  chosen  experimental  design. 

Step  2:  Solve  the  LP  for  each  of  the  s  parameter  level  combinations,  or  design 
points,  specified  in  the  experimental  design  from  Step  1.  Record  the  optimal 
objective  function  value  for  each  of  these  design  points  as  y*. 

Step  3:  Perform  least  squares  regression  to  estimate  the  coefficients,  P,  of  the 
proposed  functional  form,  y  =  ZP  +  e,  where  Z  is  an  sy.q  matrix  whose  rows 
give  the  design  points  determined  in  Step  1  along  with  columns  for  and  z,zy 
terms  and  often  a  column  of  I’s  to  give  the  model  an  intercept  term,  y  is  the  s- 
vector  of  responses  measured  in  Step  2,  and  e  is  the  error  due  to  omission  of 
higher  order  terms  in  the  functional  form.  The  resulting  estimate  of  p  is  given 
by  i3  =  (Z'Z)-'Z'y. 

The  estimate  of  p,  gives  the  magnitude  and  direction  of  a  change  in  the  optimal  objective 
function  value  with  a  one  unit  change  in  z,.  In  the  uncoded  variable  space,  we  can  see  that 
P,  gives  the  magnitude  and  direction  of  a  change  in  the  optimal  objective  function  value 
with  a  change  equivalent  to  the  half-range  specified  for  b,. 

The  authors  suggest  extending  their  technique  to  provide  accurate  predictions  of 
the  optimal  objective  function  value  for  the  LP  with  any  specified  levels  of  b  within  the 
ranges  explored.  Normally,  predictions  of  the  response  for  given  values  of  the  predictors 
would  be  obtained  from  a  fitted  regression  model  as  y  =  Zp'  b,  where  Zp  gives  the  coded 
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values  of  the  specified  settings  for  the  predictors.  Recall  that  all  of  the  error  in  e  was  due 
to  lack  of  model  fit.  Thus,  it  is  reasonable  to  believe  that  we  can  calculate  an  estimate  for 
the  lack  of  fit  error  at  Zp,  and  add  this  to  y  in  order  to  obtain  a  more  accurate  prediction. 

The  authors  propose  using  a  geostatistical  technique  called  kriging  for  estimating 
this  lack  of  fit  error  at  Zp,  denoted  £p .  Kriging  essentially  uses  a  weighted  average  of  the 

prediction  errors  at  the  design  points  in  order  to  estimate  ip  ,so  ip  =  w'e,  where  i  is  the 


vector  of  residuals  at  the  s  design  points  and  w  contains  the  selected  weights.  The  errors 
associated  with  design  points  near  Zp  are  weighted  more  heavily  than  those  far  from  Zp. 


Optimal  weights  are  chosen  by  solving 


r 

1 

w 

1' 

0. 

J. 

1 

where  y  is  an  sxs  matrix  with 


(y)ki  equal  to  the  covariance  between  the  ^h  and  /th  design  points,  1  is  an  5- vector  of 
ones,  X.  is  a  Lagrangian  multiplier,  and  jp  is  an  vector  with  (y,)*  equal  to  the  covariance 
between  Zp  and  design  point  k.  (Johnson,  Bauer,  Moore,  and  Grant,  1996:53) 

Responding  to  the  large  number  of  potential  factors  in  a  comprehensive  economic 
model,  Johnson  et.  al.  propose  using  factor  screening  design  techniques  in  the  initial  stages 
of  analysis  on  a  model  with  a  large  number  of  factors.  Once  the  most  significant  factors 
are  identified,  the  rest  of  the  process  continues  as  outlined  above  with  these  important 
factors.  (Johnson,  Bauer,  Moore,  and  Grant,  1996:52) 

Although  discontinuity  and  lack  of  smoothness  noted  by  Smith  remain  a  concern, 
kriging  lessens  the  impact  on  predictions  made  from  the  response  surface  metamodel 
(adding  back  some  constant  estimate  of  misspecification  error  would  not  address  this 
concern).  Since  kriging  gives  more  influence  to  errors  measured  near  the  point  of 
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prediction,  it  should  adjust  the  response  surface  metamodel  in  areas  where  the  smoothing 
is  most  artificial. 

Although  the  concerns  expressed  about  linear  programming  parameters  are  also 
inherent  in  the  realm  of  integer  programming,  the  methods  used  in  continuous  linear 
programming  cannot  be  directly  extended  to  integer  hnear  programs.  Since  integer 
programming  does  not  have  the  same  continuity  and  duality  properties  of  continuous 
linear  programming,  explanations  of  optimality  analysis  theory  and  techniques  are  scarce 
for  integer  programming.  Responding  to  this  need,  A.M.  Geoffrion  and  R.  Nauss  present 
the  limited  theoretical  basis  for  sensitivity  analysis  in  integer  programming.  (Geoffrion  and 
Nauss:  1977)  Since  no  unified  approach  to  sensitivity  analysis  in  this  realm  exists  for 
apphcation  to  all  types  of  problem  parameters,  Geoffrion  and  Nauss  define  two  situations 
of  interest: 

Min(c  +  0  f)'x 

x^O 

s.t.  Ax>b 

(P«) 

>0  >  0  and  integer  Vy, 

O<0  <1, 

and 

Minc'x 

x>0 

s.t.  Ax  >  b  +  0  r 

X  >  0  and  integer  Vy,  (Pe) 

O<0  <1. 

(Geoffrion  and  Nauss,  1977:453) 
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In  (Pe),  changes  in  the  constraints  change  the  feasible  region  of  the  problem,  often  causing 
a  discontinuous  “jump”  in  the  solution.  However,  in  (P®),  changes  in  the  objective 
function  parameters  have  no  effect  on  the  problem’s  feasible  region.  According  to 
Geoffrion  and  Nauss’  Propositon  3.3,  “[t]he  optimal  value  of  (P®)  for  0  <  0  <  1  is 
piecewise-linear,  continuous,  and  concave  on  its  finite  domain.”  (Geoffrion  and  Nauss, 
1977:459)  Denoting  this  optimal  value  function  for  (P®)  as  z(0),  this  proposition  implies 
that  parametric  analysis  of  the  objective  function  coefficients  may  proceed  in  the  same 
fashion  for  integer  programming  as  in  linear  programming.  However,  the  procedure  for 
determining  z(0)  is  not  as  convenient  now  that  pivots  of  a  simplex  tableau  are  no  longer 
sufficient  to  determine  its  breakpoints. 

In  an  integer  programming  context  where  solution  approaches  are  not  continuous 
in  nature,  we  must  solve  the  problem  with  fixed  values  of  0.  Geoffrion  and  Nauss  suggest 
a  procedure  for  selecting  values  of  0  which  most  efficiently  determine  the  breakpoints  of 
z(0).  They  denote  the  upper  and  lower  bounds  of  the  region  of  uncertainty  about  z(0)  as 
UB(0)  and  LB(0).  UB(0)  is  given  by  the  “lower  envelope”  of  c'  x’'  +  0f'  x''  where  x*"  are 
known  feasible  solutions  to  (P®).  LB(0)  is  found  through  linear  interpolation  between 
optimal  values  solved  for  given  values  of  0.  (Geoffrion  and  Nauss,  1977:459-460) 

We  can  see  this  in  Figures  B.l  and  B.2.  Figure  B.l  represents  the  beginning  stages 
of  the  parametric  analysis.  The  problem  has  been  solved  to  optimality  for  0  =  0  and  0  =  1, 
and  the  optimal  objective  function  values  for  these  solutions  are  shown  at  the  points  A  and 
B,  respectively.  Cuts  forming  the  current  upper  bound,  UB(0),  are  found  by  projecting 
the  line  (c  +  0f) '  x*  where  x*‘  is  the  optimal  solution  found  at  i  =  A  or  i  =  B.  This  holds 
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since  in  a  minimization  problem  the  only  reason  the  optimal  solution  would  change  along 
these  cuts  -  i.e.,  as  0  ranges  from  0  at  A  or  1  at  B  -  is  if  a  lower  objective  function  value 
was  found  with  some  solution  other  than  x*'.  The  region  of  uncertainty  surrounding  z(0) 
is  shaded.  Note  that  known  feasible  solutions  for  some  fixed  value  of  0  can  be  used 
similarly  to  add  more  cuts  on  UB(0)  and  reduce  the  region  of  uncertainty. 


'*™***™™' Cut  Optimal  at - Cut  Optimal  at  "  "  "  Lower  Bound 

0=0  0=1 

Fig.  B.l.  Early  stages  of  parametric  analysis. 


Figure  B.2  shows  how  the  region  of  uncertainty  is  reduced  as  optimal  solutions  are 
found  for  values  of  0  in  the  interval  (0,1).  Each  solution  found  is  used  to  add  a  cut  to 
UB(0)  and  a  new  point  for  LB(0),  thus  eliminating  part  of  the  region  of  uncertainty.  Note 
how  markedly  the  region  is  reduced  by  adding  the  cut  which  is  optimal  at  0=0.5  and 
defining  the  new  lower  bound  envelope  as  the  segments  connecting  A  and  B  to  the 
optimal  objective  function  value  at  0=0.5  which  is  labeled  C.  Therefore,  in  order  to  most 
rapidly  eliminate  the  area  of  uncertainty  about  the  optimal  value  function,  we  should 
always  select  the  next  setting  of  0  so  that  UB(0)  -  LB(0)  is  greatest.  (Geoffrion  and 
Nauss,  1977:460)  For  bounded  problems,  the  piece- wise  linear  optimal  value  function 
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z(0)  for  (P®)  contains  a  finite  number  of  segments,  and  can  be  determined  by  solving  for  a 
finite  number  of  values  of  9. 


Fig.  B.2.  Reducing  the  region  of  uncertainty  in  a  parametric  analysis. 

Geoffrion  and  Nauss  also  present  techniques  of  Lagrangian  relaxation  and  other 
methods  for  obtaining  additional  lower  bounds  on  z(6).  They  give  a  corollary  which 
allows  these  bounds  to  be  incorporated  in  the  parametric  analysis  process  outlined  above. 
The  corollary  states  that  the  upper  concave  envelope  formed  by  combining  LB(0)  from  the 
above  process  and  some  other  known  lower  bound  function  of  0  is  another  valid  lower 
bound.  (Geoffrion  and  Nauss,  1977:461) 

B.3.  EXPERIMENTAL  DESIGN  AND  RESPONSE  SURFACE  METHODOLOGY 

The  purpose  of  design  of  experiments  (DOE)  is  to  allow  the  experimenter  to 
adequately  illuminate  his  areas  of  interest  in  an  economical  fashion,  with  a  design 
containing  experimental  runs  so  that  just  enough  data  is  collected  to  test  the  appropriate 
hypotheses.  Response  siuface  methodology  (RSM)  incorporates  designed  experiments 
and  least  squares  regression  to  fit  an  appropriate  functional  form  to  the  data  collected. 
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RSM  generally  begins  with  first-order  designs  and  their  accompanying  models,  used  to 
identify  the  most  significant  factors  and  an  appropriate  region  for  further  investigation. 

The  second  phase  of  RSM  uses  second-order  experimental  designs  to  fit  quadratic 
polynomials  to  the  data,  providing  a  more  detailed  picture  of  the  response  surface.  The 
purpose  of  RSM  may  be  to  illuminate  the  functional  relationship  between  the  factors  set  in 
the  experiment  and  the  response  measured.  RSM  may  also  be  used  to  identify  factor 
settings  which  can  be  used  to  optimize  the  response,  or  achieve  some  pre-specified  levels 
of  one  or  multiple  responses.  (Myers  and  Montgomery,  1995:12)  See  Myers  and 
Montgomery  for  an  introduction  to  RSM. 

A  small  example  of  a  first-order  experimental  design  on  two  factors  is  shown  at 
Figure  B.3. 


Inter 

c  e  p  t 
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+ 
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-  1 

-  1 
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+  1 

-  1 
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-  1 

-1-  1 

+ 
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-1-  1 

-1-  1 

Fig.  B.3.  A  first-order  experimental  design  on  two  factors. 

Denoting  the  design  matrix  as  D,  each  row  of  D  represents  a  design  point  to  be  run 
in  the  experiment,  each  column  gives  the  specified  levels  for  its  respective  factor  in  those 
design  points.  The  intercept  column  is  included  to  provide  a  constant  term  when  a 
regression  model  is  fit  to  this  data.  It  does  not  specify  factor  settings.  The  experiment 
shown  in  Figure  B.3  is  a  two-level  factorial  experiment,  since  each  factor  is  set  at  two 
levels,  -1  for  the  low  level  and  +1  for  the  high  level.  (The  experiment  is  called  factorial 
since  including  all  combinations  of  factor  levels  causes  the  number  of  runs  to  increase  as 
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n  =  L*',  where  L  is  the  number  of  levels  and  k  is  the  number  of  factors.)  Two-level 
factorial  experiments  are  used  for  investigating  the  effects  of  individual  factors  and  factor 
interactions  such  as  AB  on  the  response,  y.  Thus,  a  factorial  experiment  allows  estimation 
of  the  parameters  in  a  functional  form  such  as  /=  -t-  where  xi 

corresponds  to  coded  values  of  factor  A  and  xz  corresponds  to  coded  values  of  factor  B. 
The  parameter  estimates  found  through  least  squares  regression  are  given  by 
p  =  (D'D)-'D'y. 

The  most  important  property  of  a  first-order  design  is  orthogonality.  An 
orthogonal  design  permits  independent  estimation  of  parameters  in  the  response  surface 
functional  form;  i.e.,  each  Pi  is  estimated  so  that  it  remains  unchanged  with  the  addition  or 
deletion  of  other  terms  in  the  regression  model.  Note  that  the  dot-product  of  any  two 
columns  in  Figure  B.3  is  zero,  thus  this  design  is  first-order  orthogonal.  While  the 
definition  of  an  orthogonal  design  matrix  changes  slightly  for  higher-order  designs,  its 
benefits  remain  the  same. 

After  the  proper  region  of  investigation  has  been  identified  and  the  pool  of 
potentially  important  factors  has  been  screened  through  the  use  of  first-order  designs, 
center  point  runs  are  added  to  the  design.  This  augmented  design  allows  for  testing  the 
presence  of  pure  quadratic  terms  such  as  xi^  and  xz.  The  presence  of  these  terms  signals 
that  it  is  time  to  proceed  to  the  second-order  phase  of  RSM. 

When  a  quadratic  functional  form  is  desired  or  indicated  by  the  hypothesis  test  on 
a  first-order  experiment  with  center  points,  we  must  add  more  design  points.  The  most 
common  second-order  design  is  called  the  central  composite  design  (CCD),  which  is 
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constructed  by  adding  axial  points  and  an  appropriate  number  of  center  points  to  a 
standard  two-level  factorial  design.  The  axial  points  are  generally  placed  at  a  distance 

a  =  from  the  center,  where  n/  is  the  number  of  design  points  in  the  factorial  core; 
since  this  ensures  that  the  variance  in  a  prediction  made  from  the  estimated  response 
surface  depends  only  upon  the  distance  of  that  prediction  point  from  the  design  center  (a 
property  called  rotatability).  Rotatability  is  generally  more  valuable  than  orthogonality  at 
this  stage  of  RSM.  Thus,  in  cases  where  it  is  not  possible  to  build  a  design  which  is  both 
rotatable  and  second-order  orthogonal,  we  prefer  a  design  which  is  rotatable  and  “near” 
orthogonal. 

The  second-order  design  is  used  to  estimate  the  parameters  of  a  second-order 
polynomial  model  such  as  +  +  +  As  in  the  first- 

order  phase  of  RSM,  statistical  tests  are  used  to  determine  if  any  of  the  terms  in  the  model 
can  be  dropped  to  achieve  a  simpler  form.  This  second-order  model  can  be  viewed  as  a 
second-order  Taylor  polynomial  approximation  for  the  true  underlying  function  and  used 
to  make  predictions  and  provide  insights  about  the  system. 

Once  we  get  past  the  basics  of  the  first-  and  second-order  phases  of  RSM,  we  see 
that  the  details  of  these  two  phases  have  also  been  rich  mines  for  research.  In  the  realm  of 
first-order  experimental  design,  there  has  been  a  wealth  of  study  on  ways  to  reduce  the 
tremendous  number  of  experimental  runs  required  by  a  factorial  experiment.  If  a  large 
number  of  factors,  k,  are  potentially  important,  we  can  no  longer  afford  the  2*'  runs 
required  for  a  two-level  factorial  experiment. 
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One  approach  to  the  problem  is  the  use  of  fractional  factorials  (2*"'^  designs),  which 
require  only  a  1/2^  fraction  of  the  design  points  specified  in  a  full  factorial  experiment  on  k 
factors.  In  such  cases,  the  design  columns  representing  levels  of  individual  factors  will  no 
longer  be  orthogonal  to  design  columns  representing  their  interactions,  thus  confounding 
their  effects.  If  two  factors  are  confounded,  then  we  are  not  able  to  separate  the  effects 
on  the  response  contributed  by  these  two  factors.  However,  with  careful  selection  of 
confounding  patterns,  fractional  factorial  designs  can  provide  a  very  efficient  approach  to 
screening  for  main  effects  and  possibly  for  low-level  interactions  as  well. 
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Fig.  B.4.  The  2v‘*'^  design. 

Figure  B.4  shows  the  design.  The  Roman  numeral  given  as  a  subscript  is  the 

resolution  of  the  design.  Design  resolution  indicates  the  level  of  confounding  present  in 

the  design.  For  example,  in  a  resolution  HI  design,  main  effects  are  confounded  with  two- 

factor  and  higher-order  interactions  but  not  with  other  main  effects.  In  a  resolution  IV 

design,  main  effects  are  not  confounded  with  other  main  effects  or  with  two-factor 

interactions,  but  two-factor  interactions  are  confounded  with  each  other  and  all  higher- 

order  interactions  are  confounded.  Resolution  V,  as  shown  in  Figure  B.4,  provides  main 

effect  estimation  which  is  not  confounded  with  any  interaction  lower  in  degree  than  a 
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four-factor  interaction.  Also,  two-factor  interactions  and  three-factor  interactions  are 
confounded  with  each  other. 

A  related  group  of  first-order  experimental  designs  are  the  Plackett-Burman 
designs.  Plackett-Burman  designs  allow  k  factors  to  be  investigated  in  as  few  as  k  +  1 
experimental  runs,  if  npB  =  k  -i- 1  is  a  multiple  of  four.  Otherwise,  we  would  require  upb 
runs,  where  npB  is  the  smallest  multiple  of  four  greater  than  k.  If  npB  runs  are  affordable, 
we  should  use  a  Plackett-Burman  screening  design,  due  to  the  acceptable  confounding 
patterns  they  exhibit.  However,  sometimes  we  cannot  afford  even  k  experimental  runs  to 
investigate  k  factors. 

Supersaturated  experimental  designs  are  those  which  investigate  k  factors  in  fewer 
than  k  runs.  Research  in  this  area  has  focused  almost  exclusively  on  three  types  of 
supersaturated  designs  and  their  application  to  sensitivity  analysis  of  large  computer 
simulation  models.  These  three  types  are  random  balance  designs,  systematic 
supersaturated  designs,  and  group- screening  designs.  (Kleijnen,  1987a;  Kleijnen,  1987b; 
Kleijnen,  1975;  Mamo,  1986;  Mauro,  1983;  Mauro  and  Burns,  1984;  D.  E.  Smith  and 
Mauro,  1982) 

Several  statisticians  have  reviewed  supersaturated  designs  and  noted  the  pros  and 
cons  of  each  type.  C.  A.  Mauro  and  J.  P.  C.  Kleijnen  have  been  prohfic  in  this  area  of 
research.  Their  works  provide  the  information  in  the  following  paragraphs. 

Although  the  definition  of  random  balance  designs  first  proposed  by  Satterthwaite 
in  1959  is  very  broad,  “random  balance”  is  now  used  to  denote  a  very  specific  type  of 
coded  design  matrix.  (Satterthwaite,  1959:112-113)  These  random  balance  designs  are 
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built  by  specifying  an  even  number  of  runs,  N,  for  the  experiment  and  the  number  of 
factors  to  be  investigated,  k.  The  design  matrix  D  is  then  Nx(k+1).  The  -1  and  +1  entries 
of  D  are  assigned  hy  random  sampling  without  replacement  so  that  each  column  contains 
N/2  negative  one  entries  and  N/2  positive  one  entries.  (Satterthwaite,  1959:113) 
Advantages  to  this  approach  are  that  N  is  controlled  and  the  design  is  simple  to  constmct 
for  any  even  value  of  N.  (D.  E.  Smith  and  Mauro,  1982:5 1;  Mauro,  1986:4-5)  There  are 
two  significant  disadvantages  to  random  balance  designs,  however.  First,  D  cannot  be 
orthogonal  for  N  <  k-i-1,  and  for  random  balance  designs  the  confounding  pattern  is 
random.  (D.  E.  Smith  and  Mauro,  1982:51;  Mauro,  1986:  5)  Second,  since  D  has  fewer 
rows  than  columns,  the  estimators  of  pi  are  biased.  (Kleijnen,  1987a:2889;  Kdeijnen, 
1987b:32;  Mauro,  1986:7)  Another  source  of  concern  is  the  lack  of  specialized  analysis 
techniques  for  the  results  of  a  random  balance  design.  Generally,  factors  are  deemed 
significant  if  their  parameters  pass  an  individual  F-test.  (D.  E.  Smith  and  Mauro,  1982:51; 
Mauro,  1986:5) 

Systematic  supersaturated  designs  are  the  results  of  attempts  to  remedy  the 
problems  of  random  balance  designs.  They  were  developed  by  Booth  and  Cox  in  1962. 
They  build  D  so  that  it  is  as  nearly  orthogonal  as  possible.  This  is  achieved  by  minimizing 
the  largest  pair-wise  dot  product  of  columns  of  D.  Booth  and  Cox  provided  a  computer 
algorithm  which  would  construct  the  systematic  supersaturated  design  for  combinations  of 
N  and  k  with  N  <  k  <  36.  However,  for  large  k,  it  may  not  be  practical  to  run  the 
computer  algorithm,  and  for  2N  <  k,  systematic  supersaturated  designs  may  not  he  much 
better  than  random  balance.  (Booth  and  Cox,  1962;  D.E.  Smith  and  Mauro,  1982:51) 
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Group-screening  is  an  entirely  different  approach.  In  group- screening 
experiments,  the  k  factors  of  interest  are  subdivided  into  g  groups  of  factors,  Gi,  G2, . . ., 
Gg.  In  the  first  stage  of  experimentation,  the  factors  within  a  group  are  treated  as  if  they 
were  a  single  factor,  so  our  design  matrix,  D,  contains  only  g  -1- 1  columns.  Generally,  a 
resolution  IV  fractional  factorial  or  Plackett-Burman  design  is  selected  for  the  grouped 
design.  (Kleijnen,  1975:492)  For  each  experimental  run  with  a  -1  in  the  Gi  column,  all 
factors  in  group  Gi  are  set  to  their  low  level.  Similarly,  where  a  +1  appears  under  Gi,  all 
factors  in  group  Gi  are  set  to  their  high  level.  The  experimental  results  are  analyzed  in  the 
standard  fashion  for  the  fractional  factorial  or  Plackett-Burman  design  selected,  and 
factors  are  dropped  from  consideration  if  their  group  is  found  to  have  a  statistically 
insignificant  effect.  The  process  repeats  itself  with  smaller  groups  until  all  factors  have 
been  ruled  out  or  tested  individually.  (Kleijnen,  1987a;  Kleijnen,  1987b;  Kleijnen,  1975; 
Mauro,  1986;  Mauro,  1983;  D.  E.  Smith  and  Mauro,  1982) 

At  any  stage  in  group- screening,  the  factors  within  a  group  are  completely 
confounded  with  each  other.  Thus,  it  is  important  that  factors  with  opposite  effects,  e.g. 
+A  and  -A,  be  placed  in  different  groups.  Otherwise,  significant  factors  may  be  dropped 
from  consideration  due  to  cancellation  within  their  group.  However,  if  factors  with 
significant  interactions  are  placed  within  the  same  group,  and  if  a  resolution  IV  design  is 
used,  there  should  be  no  confounding  of  factor  main  effects  between  groups.  (Kleijnen, 
1975:490) 

The  advantages  of  group-screening  are  the  control  exerted  over  confounding  and 
the  ease  of  application  and  analysis.  The  disadvantages  are  the  requirement  for  a  priori 
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knowledge  about  the  effects  of  factors  and  the  lack  of  control  over  the  total  number  of 
runs  required.  (Kleijnen,  1987a;  Kleijnen,  1987b;  Kleijnen,  1975;  Mauro,  1986;  Mauro, 
1983;  D.  E.  Smith  and  Mauro,  1982) 

Most  practitioners  seem  to  favor  the  group-screening  approach  over  random 
balance  and  systematic  supersaturated  methods.  In  many  practical  instances,  it  seems  that 
the  prior  knowledge  required  by  group-screening  is  available.  Moreover,  the  number  of 
runs  may  be  further  reduced  by  placing  factors  which  we  expect  to  be  significant  in  small 
groups  and  factors  expected  to  be  insignificant  in  large  groups. 

Once  the  factor-screening  is  completed  in  the  first-order  stage  of  RSM,  a  second- 
order  experiment  is  employed  and  a  second-order  model  is  fitted  to  the  data.  Thus,  at  this 
stage  in  the  process  we  become  more  concerned  with  bias  error  due  to  model 
misspecification.  The  concern  with  bias  is  particularly  important  in  the  case  of  a 
deterministic  experiment  such  as  a  mathematical  program,  where  experimental  error  does 
not  exist. 

B.K.  Ishihara  compares  bias  minimizing  and  variance  minimizing  experimental 
designs  used  with  deterministic  models  as  the  experiment.  (Ishihara,  1985)  Central 
composite  designs  and  other  second-order  rotatable  designs  often  employed  in  RSM  are 
used  as  the  variance  minimizing  designs.  The  bias  minimizing  designs  used  are  more 
obscure,  since  practitioners  traditionally  concentrate  more  on  the  variance.  Both  types  of 
experimental  design  are  applied  to  the  Arsenal  Exchange  Model,  a  linear  programming 
model  to  which  RSM  had  been  previously  applied  at  the  Air  Force  Institute  of 
Technology.  (Ishihara,  1985:1-7  to  1-8)  Ishihara  defines  effectiveness  as  the  sum  of 
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squared  deviations  between  the  response  surface  prediction  and  the  actual  value  for  a 
random  selection  of  points,  with  efficiency  measured  by  the  number  of  design  points 
required.  (Ishihara,  1985:1-8  to  1-9) 

Ishihara  found  no  significant  differences  between  the  variance  minimizing  and  bias 
minimizing  designs  with  respect  to  the  predictive  ability  and  the  explanatory  power  they 
provide.  He  notes  that  the  effectiveness  and  efficiency  goals  are  competing  objectives  in 
the  designs  studied.  However,  he  found  that  the  design  properties  of  orthogonality  and 
rotatabiUty  are  both  important  in  this  area.  Since  many  widely  used  designs  provide 
“near”  orthogonality  or  “near”  rotatability,  he  recommends  further  research  to  delineate 
these  gray  areas  and  outline  levels  of  “near”  orthogonality  and  “near”  rotatability  which 
provide  adequate  results.  (Ishihara,  1985:4-2  to  4-4) 


B-30 


APPENDIX  C:  THE  SAILS™  SYSTEM 


C.l.  OVERVIEW  OF  SAILS™ 

SAILS™  is  the  supply  chain  optimization  software  system  produced  hy  INSIGHT, 
Inc.  A  standard  in  the  industry,  this  is  the  system  used  at  PFS  LC.  It  includes  user 
interfaces  for  model  building,  run  launching,  solution  review,  and  presentation  along  with 
two  underlying  mixed  integer  linear  programming  solver  engines  (ODS  and  OPTIMA). 
The  interfaces  for  data  input  and  model  building  are  pop-up  menus  and  spreadsheet 
screens  which  generate  a  collection  of  formatted  ASCII  files  with  various  extensions  for 
use  by  the  solvers.  The  input  files  can  be  modified  through  the  user  interface  or  directly  as 
long  as  their  format  is  retained.  PFS  LC  analysts  use  the  interfaces  to  build  base  models 
for  each  project.  However,  when  minor  modifications  are  made,  experienced  analysts  find 
it  more  expeditious  to  change  the  ASCII  files  directly. 

Once  a  model  is  launched  and  the  solver  successfully  completes  its  iterations,  a 
second  collection  of  formatted  ASCII  files  are  generated  as  solution  reports.  These 
reports  can  be  viewed  directly  or  displayed  graphically  by  SAILS™  solution  display 
interfaces.  The  set  format  of  the  output  reports  allow  experienced  analysts  to  quickly  find 
the  data  elements  they  are  most  interested  in.  The  format  also  allows  creation  of 
programs  to  find  certain  data  in  repeated  applications. 

C.2.  SAILS™  INPUT  DATA  FILES 

The  input  data  files  have  the  following  extensions:  _.cdf,  _.ebf,  _.ecf,  _.faf,  _.mcf, 
_.run,  _.sdf,  _.sgf,  and  _.tdf.  The  files  contain  information  required  for  the  SAILS™ 
solvers  to  successful  execute,  as  detailed  in  Table  C.l.  below.  (Deviprasad,  1996) 
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Table  C.l.  SAILS™  input  files 


File 

Type  Extension 

Header 

Designation  Description 

System  _.sdf 

Definition 

File 

MATERIAL  List  of  Raw  Materials  Group 

PRODUCTS  List  of  Finished  Products  Group 

SUPPLIER  List  of  Raw  Materials  Supplier  Locations 

PRODUCER  List  of  Plants/Vendor  Locations 

WAREHOUS  List  of  Distribution  Center  Locations 

CUSTOMER  List  of  Customer  Region  Locations 

CUSTCLAS  Customer  Classes 

COMPONEN  List  of  Component  Products 

BNDLNAME  Product  Bundle  Names 

TIMENAME  Time  Period  Definitions 

ECHELONS  Network  Echelons 

MATLCODE  Raw  Material  Recoding  Data  (for  use  with  Transactions  file) 

PRODCODE  Finished  Product  Recoding  Data  (for  use  with  Transactions  file) 

SUPPCODE  Raw  Material  Supplier  Recoding  Data  (for  use  with  Transactions  file) 

PLNTCODE  Plant  Location  Recoding  Data  (for  use  with  Transactions  file) 

WHSECODE  Distribution  Center  Recoding  Data  (for  use  with  Transactions  file) 

TERRCODE  Customer  Region  Recoding  Data  (for  use  with  Transactions  file) 

REGNCODE  Forecast  Database  Cross  Reference  (for  use  with  Transactions  file) 

CLASCODE  Customer  Class  Recoding  Data;  Customer  Class  Codes  (for  use  with  Transactions  file) 

CUSTCODE  Customer  Class  Recoding  Data;  Customer  Account  Numbers  (for  use  with  Transactions  file) 

Facility  _.faf 

Data  File 

SU  PPCOST  Raw  Material  Supplier  Costs  and  Capacities 

PLNTCPTY  Plant  Location  Fixed  Costs  and  Capacities 

PLNFCTR  Plant  Location  -  Finished  Product  Capacity  Conversion  Factors 

COMPCOST  Production  Cost  and  Capacities;  Component  Products 

COMPLINE  Joint  Component  Production  Line  Capacity  Constraints 

PLNTCOST  Production  Cost  and  Capacities;  Finished  Products 

PLNTLINE  Joint  Finished  Product  Production  Line  Capacity  Constraints 

RMCPCNVT  Raw  Materials  -->  Component  Products  Conversion/Blending  Data 

CPFPCNVT  Component  Products  ->  Finished  Products  Conversion/Blending  Data 

RMFPCNVT  Raw  Materials  -->  Finished  Products  Conversion/Blending  Data 

PRODSTND  Finished  Product  Specific  Data 

WHSECPTY  Distribution  Center  Costs  and  Capacities 

WHSECOST  Distribution  Center  Product-Specific  Variable  Costs 

INBMILES  Inbound  Link  Distances 

REPMILES  Replenishment  Link  Distances 

INTMILES  Transfer  Link  Distances 

OUTMILES  Outbound  Link  Distances 

MATLBLND  Raw  Material  Viscosity  Indices 

COMPBLND  Comoonent  Product  Viscosity  Indices 

Transport  _.tdf 

Data  File 

IBWTDRAT  Inbound  Rates  from  Suppliers  to  Plants 

IPWTDRAT  Rates  for  Interplant  (Intraplant)  Transfer  of  Componenf  Products 

RPWTDRAT  Replenishment  Rates  from  Plants  to  Warehouses 

TRWTDRAT  Transfer  Rates  between  Warehouses 

OBWTDRAT  Outbound  Rates  from  Warehouses  to  Customer  Zones 

Demand  _.cdf 

Data  File 

CUSTDMND  Demand  by  Customer  Zone  and  Product  Group 

TRANSACT  Transactions  File 

Scenario  _.sgf 

Generation 

File 

1 

SUPPELIQ  Raw  Materials  Supplier  Eligibility 

PLNTELIG  Plant  Location  Finished  Product  Production  Eligibility 

PRODELIG  Finished  Product  Production  Eligibility 

REPLINKS  Replenishment  Link  Selections 

INTLINKS  Transfer  Link  Selections 

BUNDLING  Bundling  Strategy  Definitions 

WHSELIG  Distribution  Center  Stockage  Eligibility 

STCKELIG  Finished  Product  Stockage  Eligibility 

CLASELIG  Distribution  Center  -  Customer  Class  -  Shipment  Type  Eligibility 

OUTLINKS  Outbound  Link  Selections 

PLNTSVCE  Plant  Location  (Replenishment)  Service  Limits 

DISTSVCE  Distribution  Center  (Transfer)  Service  Limits 

WHSESVCE  Customer  Service  Limits  -  Distribution  Centers 

CLASSCAL  Customer  Class  Scale  Factors 

RDMDSCAL  Composite  Demand  Scale  Factors;  Customer  Regions 

RPSELECT  Repienishment  Weighted  Rate  Category  Assignments  &  Selection  Options 

.ecf 

Faciiitv  Status.  Reports  Requested 

.ebf 

Binary  Version  of  ,ecf  File 

.run 

Runtime  Parameters 
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C.3.  SAILS™  OUTPUT  REPORT  FILES 


The  output  data  files  have  the  extensions:  _.log,  _.min,  _.rpt,  _.tbl.  The  _.log  file 
contains  runtime  information  and  error  codes.  The  _.rpt  file  is  the  full  solver  report  on  the 
solution  obtained.  The  _.min  file  is  a  summary  of  the  full  solver  report.  The  _.tbl  is  the 
final  solver  table,  and  can  be  deleted  after  each  run.  (Deviprasad,  1996) 
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APPENDIX  D:  CIAT  USERS’  GUIDE 


D.l.  STRUCTURE 

This  appendix  begins  with  a  flow  chart  of  the  CIAT  apphcation  process.  The  flow 
chart  is  annotated  with  time  estimates  and  each  component  is  numbered  for  reference 
purposes.  The  flow  chart  is  followed  by  a  section  detailing  each  numbered  component. 
The  flow  chart  and  accompanying  details  are  designed  to  provide  a  practical  users’  guide 
to  CIAT. 
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D.2.  CIAT  PROCESS  FLOW  CHART 
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D.3.  PROCESS  DETAILS 


[1]  SAILS  base  model.  A  SAILS™  base  model  includes  all  the  required  input  files  with 
data  which  will  run  to  successful  solution  when  submitted  to  one  of  the  SAILS™ 
solver  engines. 

[2]  Extract  costs  from  SAILS  input  files  into  Excel  “Work  Space” .  The  Pascal  routine  in 
stripfaf.exe  is  designed  to  extract  both  fixed  and  variable  costs  on  the  model’s 
distribution  centers  and  place  them  in  a  text  file  named  fafout.txt.  This  file  is  opened 
in  Excel  as  a  tab  delimited  file  and  moved  into  the  Excel  workbook  containing  the 
“Work  Space”  for  this  CIAT  application.  The  Excel  macro  “faf2wkspace”  copies  aU 
nonempty  rows  in  the  “fafout”  sheet  and  inserts  them  properly  into  the  “Work  Space” 
sheet. 

[3]  Set  upper  and  lower  bounds  on  costs  to  be  investigated.  Columns  are  specified  in  the 
“Work  Space”  sheet  for  the  initial  cost  value  (extracted  from  “fafout”)  and  upper  and 
lower  bounds.  Bounds  should  only  be  added  for  costs  which  will  be  investigated  in 
the  CIAT  application.  The  analyst  may  type  in  upper  and  lower  bounds  directly  or  use 
Excel  formulae  to  specify  a  percentage  increase  and  decrease  of  the  initial  cost  value. 
If  cost  bounds  are  individually  set,  the  value  in  the  initial  cost  column  must  be  revised 
so  that  it  gives  the  midpoint  of  the  range. 

[4]  Group  costs;  use  groups  as  factors.  As  upper  bounds  are  specified,  the  number  of 
cost  factors  for  the  CIAT  application  is  tallied  in  a  cell  on  row  two  of  the  “Work 
Space”.  If  this  number  of  costs  is  greater  than  31,  groups  must  be  specified  in  the 
given  column.  Groups  should  be  designated  by  whole  numbers,  beginning  with  group 
1.  Costs  not  included  in  the  CIAT  application  should  be  denoted  by  a  zero  in  the 
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group  column.  The  second  row  of  the  “Work  Space”  also  contains  a  cell  which  holds 
the  number  of  groups  specified  (the  maximum  value  contained  in  the  group  column). 

[5]  Generate  design  table  in  JMP  for  factors.  Use  the  number  of  groups  or  number  of 
costs  from  the  top  of  the  “Work  Space”  to  generate  the  appropriate  design  in  JMP. 

To  get  JMP  to  generate  a  coded  experimental  design  table,  go  through  “Table  -> 
Design  Experiment”.  Then  select  two-level  design  from  the  pop-up  menu.  Once  the 
design  is  selected  from  the  pop-up  menu’s  listing,  click  “Generate  Design”.  At  this 
time  it  is  a  good  idea  to  save  the  design  table  to  a  permanent  file. 

[6]  Bring  coded  design  table  into  “Work  Space”  &  calculate  true  settings  for  design 
runs.  Run  the  “Export2Dsign”  macro  to  create  a  properly  formatted  text  copy  of  the 
“Work  Space”  sheet  with  the  name  “DsignBkgmd.txt”.  Bring  a  text  copy  of 
“DsignBkgmd.txt”  into  JMP  along  with  its  labels.  Subset  the  design  table  so  that  all 
numeric  columns  are  selected  and  transpose  this  subset  table  so  that  the  factor  labels, 
e.g.  “xl”,  are  at  the  beginning  of  each  row.  Change  these  labels  so  that  xl  becomes  1, 
x2  becomes  2  and  so  on.  Then  merge  the  subset  table  onto  the  text  copy  of  the 
“DsignBkgmd.txt”  by  matching  the  group  column  with  the  labels  on  the  design  table 
subset.  This  requires  use  of  the  “Table  ->  Join”  operation.  Save  the 
“DsignBkgmd.txt”  text  file  which  now  includes  the  coded  factor  settings  for  each  mn 
from  the  design  table — make  sure  to  specify  the  text  file  option  with  column  labels. 
Open  this  new  text  file  in  Excel  and  move  it  into  the  workbook  used  for  this  CIAT 
application.  This  can  be  done  by  running  the  “ImportFromDsign”  macro.  To 
calculate  the  real  cost  settings  for  the  first  design  run,  use  an  Excel  formula  which 
places  the  lower  bound  value  in  the  appropriate  cell  of  the  “Reall”  column  if  the 
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corresponding  value  in  column  “Rowl”  is  less  than  zero,  the  upper  bound  value  if 
greater  than  zero,  and  the  initial  value  otherwise.  Use  fixed  cell  references  in  the 
formula  where  you  reference  the  upper  bound  column,  the  lower  bound  column,  and 
the  initial  value  column  (this  is  done  in  current  versions  of  Excel  by  placing  a  “$” 
before  the  column  reference).  Then  the  fill  commands  can  be  used  to  place  this 
formula  in  columns  for  each  run  and  rows  for  each  factor. 

[7]  Create  SAILS  files  for  runs  in  design  table.  Run  the  “Outputdoe”  macro  to  create  a 
file  “doe.txt”  which  contains  the  real  cost  setting  revisions  for  each  run  in  the  design 
table  and  is  placed  in  the  “SAILS/modelbldr/models”  subdirectory.  Run  the  Pascal 
routines  in  “chgfaf.exe”  and  “filecopy.exe”,  specifying  the  root  file  name  for  the 
SAILS™  base  model  and  the  number  of  copies  (number  of  runs  in  the  design  table). 

[8]  Load  &  launch  SAILS  runs.  Select  the  _.ecf  files  for  each  run  in  the  design  table  and 
load  them  on  the  run  stack  in  the  SAILS™  run  launcher  interface.  Then  launch  the 
entire  collection. 

[9]  Extract  response  values  from  SAILS  _.min  files  &  add  to  coded  table  in  JMP.  Run 
the  Pascal  routine  in  “stripmin.exe”  if  you  are  only  interested  in  collecting  the  optimal 
system  cost  value.  Specify  the  root  file  name  and  number  of  copies  as  before.  This 
program  extracts  the  total  system  cost  from  each  of  the  specified  runs  and  places  them 
in  the  file  “minout.txt”  labeled  and  ordered  by  run  number.  If  you  also  want  to  collect 
a  list  of  DCs  with  positive  throughput  in  the  solution,  run  “stripm2.exe”  instead.  Open 
the  “minout.txt”  file  in  JMP  as  a  tab  delimited  text  file  with  no  labels  or  headings. 
Merge  this  file  with  the  design  table,  using  the  “Table  ->  Join”  operation.  If  the  row 
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order  has  been  retained  as  the  run  order,  the  tables  can  be  merged  by  row  number. 


Otherwise,  use  the  option  to  match  the  columns  containing  the  run  numbers. 

[lQi\Analyze  model  wllinear  least  squares  regression.  Analyze  the  model  in  JMP.  Select 
the  new  cost  column  as  the  Y  variable  and  the  factor  columns  as  the  X  variables  in  the 
model.  If  the  design  table  is  second-order,  use  the  “Effects”  drop  down  menu  to 
specify  a  response  surface  model.  Determine  which  factors  are  drivers  according  to 
their  relative  contributions  to  the  model  sum  of  squares.  Evaluate  the  presence  of 
curvature  by  looking  at  the  cost  error  of  prediction  at  the  center  point  or  by  including 
an  appropriate  number  of  center  point  rows  in  the  model  and  considering  the  value. 

{\\\Drop  groups  which  have  relatively  small  sum  of  squares.  Although  determining 
which  cost  groups  to  drop  is  a  subjective  judgment,  it  will  not  be  unusual  to  see  eighty 
to  ninety  percent  of  the  sum  of  squares  contributed  by  a  small  collection  of  the  groups. 

[12] Break  remaining  groups  into  smaller  groups  or  individual  costs  for  use  as  factors.  It 
is  possible  to  break  into  individual  costs  with  as  many  as  thirty-one  cost  factors,  with  a 
thirty-two  run  Plackett-Burman  experimental  design.  However,  it  may  require  fewer 
total  runs  if  another  grouped  stage  is  performed  before  breaking  into  individual  costs. 
This  decision  is  also  a  matter  for  subjective  analyst  choice. 

[13]  Generate  second-order  design  table  in  JMP.  This  component  is  similar  to  the  steps  in 
block  [5].  The  first  difference  is  that  the  “Response  Surface”  option  is  selected  from 
the  pop-up  menu  rather  than  the  “Two-Level  Design”  option  used  in  [5].  Axial  points 
should  be  specified  as  “face-centered”  so  that  the  only  cost  settings  required  are  the 
upper  and  lower  bounds  and  midpoint. 
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[\A]Use  rows  which  have  not  been  run  as  new  design  table.  This  requires  a  subsetting 
table  operation.  If  the  tables  are  small  enough,  the  rows  may  be  manually  selected 
before  subsetting  the  table.  If  the  tables  are  very  large,  it  may  be  preferable  to  join  the 
tables  by  the  pattern  column  first,  keeping  non-matched  rows  from  both  tables.  Then 
create  a  column  in  this  combined  table  with  a  formula  to  contain  a  one  for  rows  with 
the  same  values  in  both  pattern  columns  and  a  zero  otherwise.  Then  the  appropriate 
rows  can  be  selected  for  the  subset  table  by  using  the  “Rows  ->  Select  ->  Where. . .” 
operation. 

Finally,  if  desired,  the  response  smface  can  be  displayed  in  a  three-dimensional 
spinning  plot  platform  in  IMP.  In  order  to  generate  the  appropriate  x-y  grid  points,  use 
the  “Grdtmxy.jmp”  file  provided  in  the  “Templats”  folder.  Enter  -1  and  -i-l  in  the  spaces 
provided  for  the  start  grid  and  stop  grid  values,  respectively.  Next,  add  rows  to  the  table 
in  a  proportion  which  matches  the  density  of  spacing  you  want  in  your  graph — so  if  you 
want  fifty  points  sampled  over  the  2x2  grid  specified,  add  fifty  rows.  Return  to  the  output 
page  which  contains  the  response  surface  analysis  results  (obtained  as  a  result  of  your  “Fit 
Model”  operation).  Save  the  prediction  formula  to  the  underlying  data  table  using  the 
option  given  in  the  “$”  menu  in  the  lower  left  comer  of  the  output  page.  View  this 
prediction  formula  by  selecting  the  column  heading,  using  the  “Cols  ->  Column  Info” 
operation,  and  double  clicking  in  the  lower  left  comer  where  a  small  preview  of  the 
formula  is  displayed.  To  select  the  entire  formula,  click  on  all  portions  of  the  formula 
while  holding  the  shift  key.  Copy  the  formula  using  the  “Edit”  menu  option.  Now  return 
to  the  grid  table  and  paste  this  formula  into  the  column  given  for  the  response  values, 
using  the  same  steps  required  to  copy  the  formula.  Finally,  select  the  “Spinning  Plot” 
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option  in  the  “Graph”  menu  and  specify  the  x,  y,  and  response  columns  to  be  plotted.  The 
buttons  provided  to  the  left  of  the  plot  allow  you  to  change  your  viewing  perspective  in 
many  ways.  Once  you  obtain  a  view  which  you  would  like  to  incorporate  into  a 
presentation  in  some  other  media,  use  the  “Edit  ->  Journal”  option  to  send  the  window 
contents  to  a  journal  page  which  can  be  saved  in  a  rich  text  format. 
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APPENDIX  E:  EXCEL  USER  INTERFACE  AND  MACROS 


E.I.  USER  INTERFACE 


The  CIAT  user  interface  is  a  specially  structured  Excel  workbook.  The  workbook 
contains  a  “Work  Space”  sheet  which  is  the  basis  of  each  stage  in  a  CIAT  apphcation. 

The  initial  format  of  the  “Work  Space”  sheet  is  shown  in  Figure  E.l. 


Number  of  factors 

0 

Number  of  Groups 

1  o| 

RIe 

SAILS  ID 

Cost  Type 

Initial  Cost 

Lower  Bound 

BOTTOM  OF  TABLE 

Fig.  E.l.  Excel  “Work  Space”  format  at  start  of  CIAT  application. 


Once  the  costs  are  extracted  from  the  base  SAILS model,  the  text  file  containing 
this  data  is  added  as  another  sheet  in  the  workbook,  and  the  rows  containing  the  cost  data 
are  copied  and  inserted  into  the  table  by  one  of  the  following  Excel  macros.  Next,  the 
analyst  inserts  the  appropriate  upper  and  lower  bounds  and  adjusts  the  initial  cost  to  their 
midpoint  if  necessary.  The  number  of  costs  with  bounds  specified  is  shown  as  the 
“Number  of  factors”.  If  grouping  is  required,  the  analyst  numbers  the  costs  according  to 
their  group  membership.  Any  costs  not  included  are  designated  with  a  zero  in  the  group 
column. 

Once  the  number  of  cost  factors  or  group  factors  is  used  to  generate  the 
appropriate  experimental  design  table  in  JMP,  the  coded  factor  settings  are  merged  to  a 
text  copy  of  this  table  so  that  the  -1, 0,  and  +1  settings  appear  on  the  appropriate  rows 
(i.e.  matched  according  to  group  or  individual  cost  factor).  This  extended  table  is  also 
opened  and  added  to  the  workbook.  In  this  sheet,  the  true  cost  settings  are  calculated  for 
each  cost  factor  and  all  runs  in  the  design,  by  using  the  lower  bound  when  a  -1  appears  in 
the  coded  column,  the  initial  cost  when  a  0  appears,  and  the  upper  bound  when  a  +1 
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appears.  Finally,  the  text  file  “doe.txt”  is  created  from  this  worksheet — for  use  by 


“chgfaf.exe”  to  create  the  modified  _.faf  files  for  input  to  SAILS^^.  Copies  of  the  “Work 
Space”  sheet  and  the  sheet  used  as  “doe.txt”  can  be  saved  in  the  workbook  with  a  new 
name  so  that  the  analyst  has  a  record  of  the  stages  completed  in  this  CIAT  application. 
The  following  sections  contain  copies  of  each  of  the  macros  included  in  the  workbook  to 
automate  several  of  the  required  spreadsheet  operations. 

E.2.  EXCEL  MACROS 
E.2.1.1,  Role  of  the  ‘*Importfaf’  Macro. 

This  macro  opens  the  text  file  “FAFOUT.TXT”,  generated  by  the  “stripfaf.exe” 
program,  and  moves  it  into  our  workbook  as  an  additional  sheet.  Note  that  the  workbook 
name  must  be  inserted  in  place  of  “PFSWORK.xls”  if  the  analyst  wishes  to  use  this  macro. 
E.2.1.2.  ‘^Importfaf’  Code. 

I 

'  Importfaf  Macro 

'  Macro  recorded  1/6/97  by  Angela  Giddings 

f 

» 

Sub  importfafQ 

ChDir  "C:\SAILSNmodlbldrNmodels" 

Workbooks.OpenText  Filename  :="C:\S  AILS\modlbldr\models\FAFOUT.TXT",  _ 
Origin:=xlWindows,  StartRow:=l,  DataType:=xlFixedWidth,  _ 

FieldInfo:=  Array (Array(0, 1),  Array(3, 1),  Array(15, 1),  Array(23, 1)) 
Sheets("FAFOUT").MoveBefore:=Workbooks("PFSWORK.xls'’).Sheets(l) 

End  Sub 

E.2.2.1.  Role  of  the  ‘faf2Wkspace”  Macro. 

This  macro  inserts  the  cost  data  rows  from  the  “FAFOUT”  sheet  to  the  “Work 
Space”  sheet,  immediately  above  the  “BOTTOM  OF  TABLE”  cell. 

E.2.2.2.  “faf2Wkspacer  Code. 


E-2 


faf2Wkspace  Macro 

Macro  recorded  1/6/97  by  Angela  Giddings 


Sub  faf2wkspace() 

SheetsC'Work  Space").Select 
Range("A4").Select 

Sheets("FAFOUT").Select 
Range("A2").Select 
While  ActiveCell  <> "" 

ActiveCell.RowsC  1 : 1  ").EntireRow.Select 
Selection.Copy 
SheetsC'Work  Space").Select 
ActiveWindow.ScrollRow  =  1 
ActiveCell.RowsC  1 : 1  ").EntireRow.Select 
Selection.Insert  Shift:=xlDown 
ActiveCell.OffsetC  1 , 0).Range("  A 1  ").Select 
Sheets("FAFOUT").Select 
ActiveCell.Offset(l,  0).Range("Al  ").Select 
Wend 

Application.CutCopyMode  =  False 
End  Sub 

E. 2.3.1.  Role  of  the  “ExportlDesign”  Macro. 

This  macro  names  the  current  sheet  (which  should  be  the  “Work  Space”  sheet  or  a 
copy  of  it )  “DesignBkgmd”,  removes  the  top  two  rows  and  the  first  row  beneath  the 
headings  (which  will  be  a  header  row  from  the  FAEOUT  sheet),  and  removes  all  rows 
with  a  “0”  value  in  the  group  column. 

E.2.3.2.  “Export2Design*’  Code. 


Export2Design  Macro 

Macro  recorded  1/6/97  by  Angela  Giddings 
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Sub  Export2Design() 


Sheets.Add 

ActiveSheet.Name  =  "DesignBkgmd" 

SheetsC'Work  Space").Select 
Application.Goto  Reference:="WorkArea" 

Selection.Copy 

Sheets("DesignBkgmd").Select 

Range("Al").Select 

Selection.PasteSpecial  Paste:=xlValues,  Operation:=xlNone, 
SkipBlanks:=False,  Transpose:=False 


Range("Al").Select 

Cells.Select 

Application.CutCopyMode  =  False 

Selection.Copy 

Range("Al"). Select 

Selection.PasteSpecial  Paste:=xlValues,  Operation:=xlNone,  _ 

SkipBlanks:=False,  Transpose:=False 

While  ActiveCell  <> "" 

ActiveCell.Offset(0, 6).Range("A  1  ").Select 
If  ActiveCell  =  "0"  Then 

ActiveCell.Rows("  1 : 1  ").EntireRow.Select 
Selection.Delete  Shift:=xlUp 
Else  ActiveCell.Offset(l,  -6).Range("Al").Select 
End  If 

Wend 

ActiveWorkbook.  Save 

ActiveWorkbook.SaveAs  Filename:="C:\JMPSDsignBkgrd.txt",  _ 

FileFomiat:=xlText,  CreateBackup:=False 

End  Sub 

E.2.4.1.  Role  of  the  “ImportFromDesign”  Macro. 

This  macro  opens  the  file  “DsignBkgrd.txt”  from  the  IMP  folder  where  it  was 
appended  with  the  coded  factor  levels.  Next,  the  macro  moves  the  text  file  into  the 
“PFSWORK.xls”  workbook  as  an  additional  sheet.  If  the  current  CIAT  project  workbook 
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has  a  different  name,  the  analyst  must  replace  “PFSWORK.xls”  with  this  name  in  the 
code. 

E.2.4.2.  “ImportF romDesign”  Code. 

I 

'  ImportFromDsign  Macro 
'  Macro  recorded  1/28/97  hy  Angela  Giddings 


Sub  ImportFromDsignO 

ChDir  "C:\1MP" 

Workbooks.OpenText  Filename:="C:\JMPvDsignBkgrd.TXT",  C)rigin:=  _ 
xlWindows,  StartRow:=l,  DataType:=xlDelimited,  TextQualifier  _ 
:=xlDoubleQuote,  ConsecutiveDelimiter:=False,  Tab:=True,  _ 
Semicolon:=False,  Comma:=False,  Space:=False,  Other:=False,  _ 
FieldInfo;=Array(Array(l,  1),  Array(2, 1),  Array(3, 1),  Array(4, 1),  Array  _ 

(5,  1),  Array(6, 1),  Array(7, 1),  Array(8, 1),  Array(9, 1),  Array(10, 1),  Array(  _ 
11, 1),  Array(12, 1),  Array(13, 1),  ArTay(14, 1),  Array(15, 1),  Array(16,  1),  _ 
Array(17, 1)) 

Sheets("DsignBkgrd").Move  Before:=Workbooks("PFSWORK.xls").Sheets(l) 
End  Sub 
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E.2.5.I.  Role  of  the  “Outputdoe”  Macro. 


This  macro  works  with  the  sheet  that  contains  the  coded  factor  settings  for  each 
cost  in  columns  labeled  “Rowl”,  “Row2”,  and  so  on.  This  worksheet  should  also  have 
the  calculated  true  cost  settings  in  columns  labeled  “Reall”,  “Real2”,  etc.  The  macro 
copies  all  cells  in  the  sheet  and  pastes  in  their  values  so  that  no  information  is  lost  as  some 
cells  are  deleted.  Next  the  macro  deletes  all  columns  in  the  sheet  except  for  the  columns 
labeled  “File”,  “SAILS  ID”,  and  “Cost  Type”  along  with  any  “Real”  cost  setting  columns. 
Finally,  this  sheet  is  saved  as  the  text  fde  “doe.txt”  in  the  subdirectory 
“C:\SAILS\modelbldiNmodels”  where  the  “chgfaf.exe”  program  uses  it  to  create  the 
modified  _.faf  files  for  each  run  in  the  design. 

E.2.5.2.  ‘‘Outputdoe’*  Code. 


Outputdoe  Macro 

Macro  recorded  1/6/97  by  Angela  Giddings 


Sub  OutputdoeO 
Range("Al").Select 
Cells.Select 

Application.CutCopyMode  =  False 

Selection.Copy 

Range("Al").Select 

Selection.PasteSpecial  Paste:=xlValues,  Operation:=xlNone, 
SkipBlanks:=False,  Transpose:=False 

While  ActiveCell  <>  "Reall" 

If  ActiveCell  <>  "Reall"  Then 
If  ActiveCell  <>  "File"  Then 
If  ActiveCell  <>  "SAILS  ID"  Then 
If  ActiveCell  <>  "Cost  Type"  Then 

ActiveCell.Columns("A:A").EntireColumn.Select 
Selection.Delete  Shift:  =xlToLeft 
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ActiveCell.Select 

ActiveCell.Offset(0,  -l).Range("Al  ").Select 

End  If 
End  If 
End  If 
End  If 

ActiveCell.Offset(0,  l).Range("Al  ").SeIect 
Wend 

ActiveSheet.Name  =  "doe" 

ActiveWorkbook.Save 
Rows("l:l").Select 
Selection.Delete  Shift:=xlUp 

Active  Workbook.SaveAs  Filename:="C:NSAILS\modlbldi\models\doe.txt", 
FileFomiat:=xlText,  GeateBackup:=FaIse 


End  Sub 


APPENDIX  F:  PASCAL  ROUTINES 


F.l.  STRIPFAF 

F.1.1.  Purpose  and  Implementation. 

The  stripfaf.exe  program  is  the  executable  file  created  once  the  following  Turbo 
Pascal  program  stripfaf.pas  is  compiled.  This  program  reads  the  _.faf  file  with  the  input 
root  file  name  found  in  its  resident  subdirectory,  and  formats  relevant  fields  (i.e.,  the 
SAILS'^M  id  number  for  each  distribution  center,  a  label  to  denote  the  cost  type  for  that 
line,  and  the  fixed  or  variable  cost  value  associated  with  that  SAILS'^^  id  number)  for 
output  to  a  text  file  which  wiU  be  used  in  the  Excel  “Work  Space”  (the  Cl  AT  user 
interface). 

The  executable  file  should  be  invoked  from  the  DOS  prompt  in  its  resident 

subdirectory  with  the  command  “stripfaf  <root  file  name>  <number  of  file  copies>”.  The 

second  parameter  is  actually  not  necessary,  but  it  is  included  for  consistency  with  the  other 

Turbo  Pascal  programs. 

F.1.2.  Stripfaf.pas  Code. 

Program  FAF; 

uses  CRT; 

const 

twosp  = ' 
fivesp  = ' 

vLabel  = '  variable 
fLabel  = '  fixedcst 

var 

FAFfile :  text; 

FAFout ;  text; 

S  :  string[80]; 
header :  string[10]; 
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fixed :  string[10]; 
variable :  string[10]; 
SAILSID  :  string[10]; 
filelD  :  slring[5]; 
oldName :  string[10]; 


begin  {program} 

ClrScr; 

if  ParamCount  <>  2  then 
begin 

Writeln('Input  root  file  name  and  number  of  copies.'); 

Halt; 

end; 

oldName:=Concat(ParamStr(l),  ’.faf); 

Assign(FAFfile,  oldName); 

Assign(FAFout,  'FAFout.txt'); 

Reset(FAFfile); 

Rewrite(FAFout); 

While  not  Eof(FAFfile)  do 
begin  (while  not  eof  FAFfile  1 } 

Readln(FAFfile,  S); 
if(S  =  'WHSECPTY')then 
begin  {if  S  =  'WHSECPTY'} 

While  not  Eof(FAFfile)  do 
begin  {while  not  eof  FAFfile  2} 
header  :=Copy(S,  1,10); 
fixed:=Copy(S,21,10); 
variable:=Copy(S  ,61,10); 

SAILSID:=Copy(S,77,4); 

FUeID:=TAF'; 
if  not(header='  ')  then 

Writeln(FAFout,  fileID,twosp,header); 
if  header='  '  then 
begin 

Writeln(FAFout,  fileID,twosp,SAILSID,fivesp,fLabel,fixed); 
Writeln(FAFout,  fileID,twosp,SAILSID,fivesp,vLabel, variable); 
end; 

Readln(FAFfile,S); 
end;  {while  not  eof  FAFfile  2} 

header; =Copy(S  ,1,10); 
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fixed:=Copy(S,21,10); 

variable:=Copy(S,61,10); 

SAILSID:=Copy(S,77,4); 

FUeID:='FAF'; 
if  not(header='  ')  then 
Writeln(FAFout,  fileID,twosp,header); 
if  header='  '  then 
begin 

Writeln(FAFout,  fileID,twosp,SAILSID,fivesp,fLabel, fixed); 
Writeln(FAFout,  fileID,twosp,SAILSID,fivesp,vLabel, variable); 
end; 

end;  {if  S  = 'WHSECPTY'} 
end;  (while  not  eof FAFfile  1 } 

Close(FAFfile); 

Close(FAFout); 

Writeln; 

WriteCPress  ENTER...'); 

Readln; 


F.2.  CHGFAF 

F.2.1.  Purpose  and  Implementation. 

The  chgfaf.exe  program  is  the  executable  file  created  once  the  following 
Turbo  Pascal  program  chgfaf.pas  is  compiled.  This  program  reads  the  _.faf  file  with  the 
input  root  file  name  and  the  doe.txt  file  found  in  its  resident  subdirectory.  The  doe.txt  file 
is  a  text  file  which  contains  the  new  settings  for  the  fixed  and  variable  costs  in  all  runs  of 
the  chosen  experimental  design.  This  file  is  created  and  formatted  in  Excel  after  the  coded 
design  table  is  obtained  from  JMP. 
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This  program  creates  a  _.faf  file  for  each  run  specified  in  doe.txt  and  names  them 


<root  file  name>l.faf,  <root  file  name>2.faf, <root  file  namexnumber  of  file 
copies>.faf.  These  files  contain  the  cost  settings  as  specified  by  the  design. 

The  executable  file  should  be  invoked  from  the  DOS  prompt  in  its  resident 
subdirectory  with  the  command  “chgfaf  <root  file  name>  <number  of  file  copies>”. 
F.2.2.  Chgfaf  .pas  Code. 

Program  RunChangeFAF; 
uses  CRT; 
var 

RootName  :  string[6]; 
max,  where :  integer; 
logfile ;  text; 

Procedure  ChangeFAF(name  :  string;  max :  integer); 
type 

fixedPtr  =  '^fixedRec; 
fixedRec  =  record 
fileld ;  string[3]; 

SAILSid ;  string[4]; 
costType ;  string[8]; 
costs tr ;  string[10]; 

Prev,  Next ;  fixedPtr; 
end; 

variaPtr  =  ^variaRec; 
variaRec  =  record 
fileld :  string[3]; 

SAILSid :  string[4]; 
costType :  string[8]; 
costStr :  string[5]; 

Prev,  Next :  variaPtr; 
end; 


linePtr  =  ■^lineRec; 
lineRec  =  record 
Stmg :  string[100]; 
Prev,  Next :  linePtr; 
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end; 


var 

fixed :  fixedRec; 
variable ;  variaRec; 

firstFxd,  prevFxd,  currentFxd,  lastFxd :  fixedPtr; 

firstVar,  prevVar,  currentVar,  lastVar ;  variaPtr; 

fixedCst :  real; 

variaCst ;  real; 

fixedS  :  string[10]; 

variaS  :  string[5]; 

oldFile,  newFile,  RunFile  :  text; 

i,j ;  integer; 

inputFName,  newFName :  string[12]; 
fid :  string[3]; 

Sid :  string[5]; 
cstType ;  string[9]; 
line :  lineRec; 

firstLn,  prevLn,  currentLn,  lastLn  :  linePtr; 

Jchr ;  string[2]; 

SAILSchk :  string[6]; 
seekingF,  seekingV :  boolean; 


{•1m  t •^  mIm  4^  4^ 

^  yp  yp  ^p  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  ^  •P  •P  ^  ^  I 

Procedure  InitiateRunFiles(name  :  string;  max  :  integer); 
var 

S  :  string[100]; 
redflag :  boolean; 
j  :  integer; 

begin  {procedure  InitiateRunFiles} 

inputFName:=Concat(name,  '.faf); 

Assign(oldFile,  inputFName); 

Reset(oldFile); 

for  j  :=  1  to  max  do 
begin 

Str(j,  Jchr); 

newFName;=  Concat(name,  Jchr,  ’.faf); 
Assign(newFile,  newFName); 

Rewrite(newFile); 

(Copy  unchanged  portion  of  FAF  file} 
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redflag:=false; 

While  not  redflag  do 
begin 

Readln(oldFile,  S); 
ifS  =  ’WHSECPTY'then 
begin 

redflag  :=tme; 

Writeln(newFile,S); 

Close(newFile); 
if  j  <  max  then 
Reset(oldFile); 
end; 

if  S  <>  ’WHSECPTY’  then 
begin 

Writeln(newFile,  S); 
end; 

end;  {while  not  redflag} 
end;  {forj} 

(read  last  portion  of  oldFile  into  linked  hst  for  modification} 
firstLn  :=  nil; 

While  not  Eof(oldFile)  do 
begin 

if  firstLn  =  nil  then 
begin 

New(currentLn); 
with  currentLn'^  do 
Readln(oldFile,  Stmg); 
currentLn^.Next  :=  nil; 
currentLn^.Prev  :=  nil; 
firstLn  :=  currentLn; 
lastLn  ;=  currentLn; 
end 
else 
begin 

prevLn  :=  lastLn; 

New(currentLn); 
with  currentLn^  do 
Readln(oldFile,  Stmg); 
prevLn-^.Next  ;=  currentLn; 
currentLn^.Next  :=  nil; 
currentLn'^.Prev  :=  prevLn; 
lastLn  :=  currentLn; 
end; 

end;  {while  not  eof} 
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Close(oldFile); 


{print  contents  of  list} 

Writeln(logFile,  'Contents  of  old  list;'); 
currentLn:=firstLn; 
while  currentLnonil  do 
begin 

with  currentLn'"'  do 
Writeln(logFile,  Stmg); 
currentLn;=currentLn'^.next; 
end;  (print  loop) 


end;  (procedure  InitiateRunFiles} 


begin  (ChangeFAF  procedure} 

Assign(logFile,  'pasLog.txt'); 
Rewrite(logFile); 

InitiateRunFiles(name,  max); 

Writeln; 

(Prepare  RunFUe} 

Assign(RunFile,  'doe.txt'); 

for  j:=l  to  max  do 
begin  (for  j:=l  to  max} 

firstFxd:=nil; 

firstVar:=nil; 

(read  data  from  RunFile  for  run  j 
into  fixed  &  variable  lists } 
Reset(RunFile); 
while  not  Eof(RunFile)  do 
begin  (while  not  eof} 
Read(RunFile,  fid,  Sid,  cstType); 
Delete(SId,  1, 1); 
Delete(cstType,  1, 1); 

if  cstType  =  'fixedcst'  then 
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begin  {if  fixed  cost} 

i:=l; 

while  i  <  j  do 
begin 

Read(RunFile,  fixedCst); 

Writeln(logFile,  Sid,  'Reading  past  fixed  cost fixedCst); 
fixedCst:  =0; 
i:=i+l; 
end; 

Readln(RunFile,  fixedCst); 

Wiiteln(logFile,  Sid,  'Set  on  fixed  cost ',  fixedCst); 

Str(Round(fixedCst),  fixedS); 

while  Length(fixedS)  <  10  do 
begin  (while) 
insertC ',  fixedS,  1); 
end; 


if  firstFxd=nil  then 
begin  (if  fixed  list  empty) 
New(currentFxd); 
with  currentFxd''  do 
begin 

costStr:=fixedS; 

fileld:=fld; 

SAILSid:=SId; 

costType:=cstType; 

end; 

currentFxd'^.Next  :=  nil; 
cuiTentFxd''.Prev  :=  nil; 
firstFxd  :=  currentFxd; 
lastFxd  :=  currentFxd; 
end  (if  fixed  list  empty) 
else 

begin  (if  fixed  list  not  empty) 
prevFxd  ;=  lastFxd; 
New(currentFxd); 
with  currentFxd''  do 
begin 

costs  tr:=fixedS; 
fileld:=fld; 
SAILSid;=SId; 
costType;=cstType; 
end; 
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prevFxd^.Next  :=  currentFxd; 
currentFxd'^.Next  :=  nil; 
currentFxd'^.Prev  :=  prevFxd; 
lastFxd  :=  currentFxd; 
end;  { if  fixed  list  not  empty } 
end;  {if  fixed  cost} 


if  cstType  =  'variable'  then 
begin  (if  variable} 
i:=l; 

while  i  <  j  do 
begin 

Read(RunFile,  variaCst); 

Writeln(logFile,  Sid,  'Reading  past  variable  cost ',  variaCst); 
variaCst:  =0; 
i:=i+l; 
end; 

Readln(RunFile,  variaCst); 

Writeln(logFile,  Sid,  'Set  on  variable  cost ',  variaCst); 
Str(Round(variaCst),  variaS); 

while  Length(variaS)  <  5  do 
begin 

InsertC  ',variaS,  1); 
end; 

if  firstVar=nil  then 
begin  (if  variable  list  empty} 

New(currentVar); 
with  current Var^  do 
begin 

costStr:=variaS; 

fileld:=fld; 

SAILSid;=SId; 

costType:=cstType; 

end; 

currentVar^.Next  :=  nil; 
currentVar''‘.Prev  :=  nil; 
firstVar  :=  currentVar; 
lastVar  :=  currentVar; 
end  (if  variable  list  empty} 
else 

begin  (if  variable  list  not  empty} 
prevVar  :=  lastVar; 
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New(currentVar); 
with  currentVar'^  do 
begin 

costStr;=variaS; 

fileld:=frd; 

SAILSid;=SId; 

costType;=cstType; 

end; 

prevVar'^.Next  :=  currentVar; 
currentVar'^.Next  :=  nil; 
currentVar''.Prev  ;=  prevVar; 
lastVar  :=  currentVar; 
end;  { if  variable  list  not  empty } 
end;  {if  variable  cost} 

end;  (while  not  eof} 

{debugging} 
currentFxd:  =firstFxd; 
while  currentFxd  <>  nil  do 
begin 

with  currentFxd^  do 

Writeln(logFile,'j=',j,'F:  ',{fileld,'*',}  SAILSid,'*',  costType,'*',  costStr,'*', 
Length(costStr)); 

currentFxd:  =currentFxd'^.next; 
end;  {fixed  debugging  while  loop} 
current  V  ar : =f ir  stVar; 
while  currentVar  <>  nil  do 
begin 

with  currentVar^  do 

Writeln(logFile;j=',j,'V:  'Jfileld,'*',}  SAILSid,’*',  costType,’*',  costStr,’*', 
Length(costStr)); 

currentV  ar:=currentV  ar'^.next; 
end; {variable  debugging  while  loop} 

{prepare  newFile  for  run  j  for  modified  last  part} 

Str(j,  Jchr); 

newFName;=  Concat(name,  Jchr,  ’.faf); 

Assign(newFile,  newFName); 

Append(newFile); 

currentLn:=firstLn; 
while  currentLn  <>  nil  do 
begin  {while  old  list  not  at  end} 
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with  currentLn^  do 
begin  {with  current  line  of  old  list} 

SAILSchk:=Copy(Stmg,77,4); 
currentFxd:  =firstFxd; 
seekingF:=true; 
currentV  ar:  =fir  stV  ar; 
seekingV:=tme; 

(check  fixed  list) 

while  (currentFxd  <>  nil)  and  (seekingF)  do 
begin 

with  currentFxd^  do 
begin 

if  SAE.Schk=SAILSid  then 
begin 

Delete(Stmg,  21, 10); 

Insert(costStr,  Stmg,  21); 
seekingF:=false; 
end; 

end;  {with  current  fixed  entry} 
currentFxd:=currentFxd''.next; 
end;  {check  of  fixed  list} 

{output  message  fixed  cost  not  used} 
if  seekingF  then 

Writeln(logFile, 'Fixed  cost  for  SAILS  id SADLSchk, '  not  changed.'); 
{check  variable  list} 

while  (currentVar  <>  nil)  and  (seeking V)  do 
begin 

with  currentVar'''  do 
begin 

if  SAILSchk=SAILSid  then 
begin 

Delete(Stmg,  66, 5); 

Insert(costStr,  Stmg,  66); 
seekingV:=false; 
end; 

end;  {with  current  variable  entry} 
currentV  ar;  =currentV  ar''  .next; 
end;  { check  of  variable  list } 

{output  message  variable  cost  not  used} 
if  seekingV  then 

Writeln(logFile,'Variable  cost  for  SAILS  id ',  SAILSchk, '  not  changed.'); 
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Writeln(newFile,  Stmg); 
currentLn : =currentLn'^.next; 


end;  {with  current  line  of  old  list} 

end;  (while  old  list  not  at  end) 

{debugging} 

currentLn:  =firstLn; 
while  currentLnonil  do 
begin 

with  currentLn'^  do 
begin 

Writeln(logFile, old: '); 
Writeln(logFile,  Stmg); 
end; 

currentLn:=currentLn'^.next; 
end;  {old  list  debugging  while  loop} 

Close(newFile); 
end;  {forj:=l  to  max} 


Close(RunFile); 


end;  {ChangeFAF  procedure} 
begin  {Program} 

ClrScr; 

if  ParamCount  <>  2  then 
begin 

Writeln('Input  root  file  name  and  number  of  copies.'); 
Halt; 
end; 


Writeln('In  program'); 
RootName:=  ParamStr(l); 
Val(ParamStr(2),  max,  where); 


F-12 


ChangeFAF(RootName,  max); 

Write('End  program.  Press  ENTER...’); 

Readln; 

end. 

F.3.  FILECOPY 

F.3.1.  Purpose  and  Implementation. 

The  filecopy.exe  program  is  the  executable  file  created  once  the  following  Turbo 
Pascal  program  filecopy.pas  is  compiled.  This  program  copies  all  input  SAILS  files  except 
for  the  _.faf  file  the  specified  number  of  times  and  names  them  <root  file  name>l.xxx, 
<root  file  name>2.xxx, .. .,  <root  file  namexnumber  of  file  copies>.xxx,  where  the  xxx 
represents  the  file  extensions  for  SAILS  input. 

The  executable  file  should  be  invoked  from  the  DOS  prompt  in  its  resident 
subdirectory  with  the  command  “filecopy  <root  file  name>  <number  of  file  copies>”. 

F.3.2.  Filecopy.pas  Code. 

Program  CopyFile; 
uses  CRT; 
type 

fileType  =  (cdf,  ebf,  ecf,  mcf,  mn,  tdf,  sdf,  sgf); 

extType  =  array  [cdf..{,  ebf,  ecf,  mcf,  run,  tdf,  sdf,}  sgf]  of  string[4]; 

var 

SourceFile, 

DestFile :  file; 

RecordsRead :  integer; 

Buffer ;  array  [1..1000]  of  byte; 
newName,  oldName  ;  string  [12]; 

S  :  string  [2]; 
ctr,max, where :  integer; 
extCtr :  fileType; 
extension :  extType; 

begin 
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ClrScr; 

if  ParamCount  <>  2  then 
begin 

Writeln('Input  root  file  name  then  number  of  copies.'); 

Halt; 

end; 

extension[cdf]  :='.cdf ; 
extension[ebf]  :='.ebf ; 
extension[ecf]  ;='.ecf ; 
extension[mcf]  ;='.mcf ; 
extension[run]  :='.mn'; 
extension[tdf]:=  '.tdf ; 
extension[sdf]  ;='.sdf ; 
extension[sgf]  :='.sgf ; 

for  extCtr:=cdf  to  sgf  do 
begin 

oldName:=Concat(ParamStr(l),  extension[extCtr]); 

Val(ParamStr(2),  max,  where); 
for  ctr  :=  1  to  max  do 

begin  {Make  specified  copies  of  this  file  type} 

Str(ctr,S); 

newName:=Concat(ParamStr(l),  S,  extension[extCtr]); 
Assign(SourceFile,  oldName); 

{!-} 

Reset(SourceFile,  1); 

(!+} 

if  lOresult  <>  0  then 
begin 

Writeln('Input  file  not  found.'); 

Halt; 

end; 

WritelnC  .  =  1,000  bytes  copied.'); 

Assign(DestFile,  newName); 

Rewrite(DestFile,  1); 

BlockRead(SourceFile,  Buffer,  SizeOf(Buffer),  RecordsRead); 

while  RecordsRead  >  0  do 
begin 
WriteC.'); 

BlockWrite(DestFile,  Buffer,  RecordsRead); 
BlockRead(SourceFile,  Buffer,  Sizeof(Buffer),  RecordsRead); 
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end; 


Close(SourceFile); 

Close(DestFile); 

end; 


Writeln; 

WriteCPress  ENTER...'); 

Readln; 

end; 

end. 

F.4.  STRIPMIN 

F.4.1.  Purpose  and  Implementation. 

The  stripmin.exe  program  is  the  executable  file  created  once  the  following  Turbo 
Pascal  program,  stripmin.pas,  is  compiled.  This  program  reads  the  _.min  SAILS  output 
files,  with  the  specified  root  file  name,  found  in  its  resident  subdirectory  and  formats 
relevant  fields  (i.e.  the  run  number  and  the  optimal  objective  function  value  for  that  run) 
for  output  to  a  text  file  which  will  be  used  to  provide  response  values  for  the  JMP 
experimental  design  table  generated  for  this  stage  in  CIAT  implementation. 

The  executable  file  should  be  invoked  fi'om  the  DOS  prompt  in  its  resident 
subdirectory  with  the  command  “stripmin  <root  file  name>  <number  of  file  copies>”. 
F.4.2.  Stripmin.pas  Code. 

Program  stripMIN; 

uses  CRT; 

var 

MINfile :  text; 

MINout :  text; 

LogFile :  text; 

S  :  string[180]; 
header :  string[30]; 
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totalCst :  string[20]; 
SAILSID  :  string[10]; 
filelD  ;  string[5]; 
oldName :  string[12]; 
j,  max,  where  :  integer; 
Jchr :  string[2]; 


begin  {program} 

ClrScr; 

if  ParamCount  <>  2  then 
begin 

Writeln('Input  root  file  name  and  number  of  runs.'); 
Halt; 
end; 

Assign(logFile,  'pasLog.txt'); 

Rewrite(logFile); 

Assign(MINout,  'MINout.txt'); 

Rewrite(MINout) ; 

Val(ParamStr(2),  max,  where); 

writeln(logFile,  'In  program'); 


for  j:=l  to  max  do 
begin  {forj:=l  to  max) 

Str(j,  Jchr); 

oldName:=Concat(ParamStr(l),  Jchr,  '.min'); 
Assign(MINfile,  oldName); 

Reset(MINfile); 

write(logFile,  'Reading  file  ', oldName); 
writeln(logFile, ',  j=',j,'  *',jchr,'*'); 

header;=' '; 
repeat 

Readln(MINfile,  S); 
header:=Copy(S,  10,  21); 
until  header='SYSTEMWIDE  TOTAL  COST'; 

Write(MINout,  'Run  ',  Jchr, ':'); 
Writeln(MINout,  copy(S,80,25)); 
write(logFile,  'Run ',  Jchr, ':'); 
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writeln(logFile,  copy(S,80,25)); 

Close(MINFile); 

end;  {for  j:=l  to  max} 
Close(MINout); 

Close(logFile); 

end.  {program} 


F.5.  STRIPM2 


F.5.1.  Purpose  and  Implementation. 

The  stripm2.exe  program  is  the  executable  file  created  once  the  following  Turbo 
Pascal  program,  stripm2.pas,  is  compiled.  This  program  acts  in  exactly  the  same  fashion 
as  the  stripmin.exe  program,  except  that  it  also  collects  the  names  of  all  DCs  with  a 
positive  throughput  in  the  _.min  report  for  each  run. 

The  executable  file  should  be  invoked  from  the  DOS  prompt  in  its  resident 
subdirectory  with  the  command  “stripm2  <root  file  name>  <number  of  file  copies>”. 
F.5.2.  Stripm2. pas  Code. 

Program  stripMIN; 

uses  CRT; 

var 

MlNfile :  text; 

MlNout :  text; 

LogFile :  text; 

S  :  string!  180]; 
header :  string[30]; 
throughtxt :  string!  15]; 
throughput ;  integer; 
totalCst :  string!20]; 

SAILSE) :  string!10}; 
filelD ;  string!5]; 
oldName :  string!12]; 
j,  max,  where  :  integer; 
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Jchr :  string[2]; 


begin  {program} 

ClrScr; 

if  ParamCount  <>  2  then 
begin 

Writeln('Input  root  file  name  and  number  of  runs.'); 
Halt; 
end; 

Assign(logFile,  'pasLog.txt'); 

Rewrite(logFile); 

Assign(MINout,  'MINouttxt'); 

Rewrite(MINout); 

Val(ParamStr(2),  max,  where); 

writeln(logFile,  'In  program'); 


for  j;=l  to  max  do 
begin  (forj:=l  to  max} 

Str(j,  Jchr); 

oldName;=Concat(ParamStr(l),  Jchr,  '.min'); 
Assign(MINfile,  oldName); 

Reset(MINfile); 

write(logFile,  'Reading  file  ', oldName); 
writeln(logFile, ',  *',jchr,'*'); 

header:=' '; 
repeat 

Readln(MINfile,  S); 
header;=Copy(S,  10, 21); 
until  header='SYSTEMWIDE  TOTAL  COST'; 

Write(MINout,  'Run  ',  Jchr, ':'); 

Write(MINout,  copy(S,80,25)); 
write(logFile,  'Run ',  Jchr, ':'); 
writeln(logFile,  copy(S, 80,25)); 
repeat 

Readln(MINfile,  S); 
header:=Copy(S,  1, 17); 
until  header='  REPORT  #:  56-05A'; 
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writeln(logFile,  header); 

Readln(MINfile,  S); 

Readln(MINfile,  S); 

Readln(MINfile,  S); 

Readln(MINfile,  S); 

repeat 

Readln(MINfile,  S); 
header: =Copy(S,  1,  20); 
throughtxt:=Copy(S,  25, 14); 
write(logFile,  header); 
write(logFile,  throughtxt); 

Val(throughtxt, throughput, where); 
writeln(logFile,  '*',throughput); 
if  throughput  >  0  then  write(MINout,  header); 
until  header='0  TOTALS:  '; 

writeln(MINout); 

Close(MINFile); 

end;  {forj:=l  to  max} 

Close(MINout); 

Close(logFile); 


end.  {program} 
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